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MANNOSIDASE STRUCTURES 



A portion of the disclosure of this patent document contains material that is subject to 
copyright protection. The copyright owner has no objection to the facsimile reproduction by 
5 anyone of the patent document or patent disclosure, as it appears in the Patent and Trademark 
Office patent file or records, but otherwise reserves all copyright rights whatsoever. 

FIELD OF THE INVENTION 

10 The present invention relates to crystal structures. In particular, the invention relates to crystals 

Q comprising a mannosidase II ligand binding domain (LBD), optionally having a ligand which 

,5 is associated therewith. The structures may be used to determine mannosidase homologues 

^ and information about the secondary and tertiary structures of polypeptides which are as yet 

sij structurally uncharacterised. The structures may also be used to identify ligands which are 

15 capable of binding the ligand binding domain. Such ligands may be capable of acting as 
modulators of mannosidase II activity. 

[Jj BACKGROUND 

a H " 

'** 20 Mannosidase II enzymes 

There has been widespread interest in mannosidases in recent years, largely due to their role 
in a multitude of biological systems and, as a result, their potential as therapeutic targets. In 
particular, mammalian Golgi a-mannosidase II is involved in glycoprotein biosynthesis 
(especially in the maturation of N-linked oligosaccharides on newly synthesized 
25 glycoproteins) and is currently an important therapeutic target for the development of anti- 
cancer agents (Goss et al (1995) Clin. Cancer Res. 1 :935-944). 

Golgi a-mannosidase II (mannosyl oligosaccharide 1,3-1,6-a-mannosidase II, EC 3.2.1.114; 
also referred to herein as "GMII") belongs to the glycosyl hydrolase family 38 (Henrissat, 
30 1991; Coutinho and Henrissat, 1999) and is central to the Golgi processing pathway, as it 
specifically trims two mannose residues from the branched GlcNAcMan5GIcNAc2 mannose 
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intermediate (Figure 8A) to form the core GlcNAcMaii3GIcNAc2 glycosyl structure, an 
essential precursor for the further addition of N-acetyl-glucosamine units. GMII is a Type II 
transmembrane protein, approximately 125 kD in size, composed of a short N-terminal 
cytoplasmic tail, a single-span transmembrane domain and a large lumenal C-terminal 
5 catalytic portion (Moremen and Touster, 1985, 1986). The enzyme is highly specific for the 
presence of the single GlcNAc attached in a a 1,2 linkage to the Man al,3-Man arm of the 
GlcNAcMan 5 GlcNAc2-Asn-X substrate (Harpaz and Schachter, 1980). It removes the di- 
mannose branch (M6, M7; Figure 8A) by hydrolysis of both glycosidic bonds with net 
retention of sugar anomeric configuration, resulting in the final tri-mannose 
10 GlcNAcMan3GlcNAc2 core. There is little or no experimental evidence to date addressing 
whether the two bonds are cleaved in separate binding sites or sequentially in the same 
binding site, nor whether or not the singly-hydrolyzed product is released from the enzyme 
between the two cleavage events. 

15 Mammalian lysosomal-mannosidase has significant sequence similarity to the GM II enzyme 
and is responsible for glycoprotein degradation (Moremen et al (1994) Glycobiology 4 113- 
125; Liao et al (1996) J. Biol. Chem. 271:28348-28358). In particular, lysosomal ct- 
mannosidase II is involved in the catabolism of N-linked glycoproteins through the sequential 
degradation of high mannose, hybrid and complex oligosaccharides. 

20 

Mutations in the gene encoding mannosidase II cause ct-mannosidosis, an autosomal recessive 
lysosomal storage disease (Ockermann (1967) Lancet 2:239-241). 

A number of mannosidase II genes have been characterised from different sources, including 
25 the Drosophila gene (Foster et al (1995) Gene 154:183-186; Rabouille et al (1999) J. Cell Sci. 
112:3319-3330), rat gene (Spiro et al (1997) J. Biol. Chem. 272:29356-29363) and human, 
mouse, bovine and feline genes (Beccari et al (1999) Bioscience reports 19:158-162). These 
mannosidases have been categorized as class II mannosidases, based on sequence alignment, 
and belong to family 38 in Henrissat's glycosidase classification (Moremen et al (1994) as 
30 above, Henrissat and Bairoch (1996) Biochem J. 3 1 6:695-696). 
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To date there have been significant problems with high level expression of these enzymes, 
which has impeded structural and mechanistic studies. Indeed, problems with expression 
have meant that a-mannosidase from Jack Bean (Canavalia ensiformis) has been used as a 
5 model enzyme for structural and functional characterisation (Howard et al (1998) J. Biol. 
Chem. 273:2067-2072; Kimura et al (1999) Eur. J. Biochem. 164:168-175). In view of the 
potential therapeutic application of mannosidase inhibitors, there is a need for direct structural 
characterisation of these enzymes. 

10 Swainsonine 

Swainsonine (SW) is an indolizidine alkaloid found in Australian Swainsona canescens 
(Colegate etaL, Aust J Chem 32:2257-2264, 1979), North American plants of the genera 
Astragalus and (Molyneux R J and James L F., Science 215:190-191, 1981), and also the 
fungus Rhizoctonia leguminicola (Schneider et al., Tetrahedron 39;29-31, 1983). 

15 

Swainsonine is a potent and specific inhibitor of the lysosomal and golgi forms of alpha- 
mannosidase (Cenci di Bello et al., Biochem. J. 215, 693 (1983); Tulsiani et al., J. Biol. 
Chem. 257, 7936 (1982)). It has potential therapeutic value as an antimetastatic (Humpheries 
et al., Cancer Res. 48, 1410 (1988)), and tumor-proliferative (Dennis, Cancer Res. 46, 5131 
20 (1986)), or immunoregulatory agent (Kino et al, J. Antibiot. 38, 936 (1985)). Swainsonine 
has also been shown to have positive effects on cellular immunity in mice (reviewed in 
Humphries M J. and Olden K., Pharmacol Ther. 44:85-105, 1989, and Olden et al., 
Pharmacol Ther 50:285-290, 1991)). 

25 Structural information about the interaction between swainsonine and mannosidase II 
enzymes would provide a basis for rational modification of swainsonine derivatives with 
altered activities. It would also provide a framework on which new ligands could be designed 
which mimic some of the swainsonine:mannosidase atomic interactions. 
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SUMMARY OF THE INVENTION 

The present invention is based on the finding that, after extensive modifications to the 
protocol, it is possible to express mannosidase II in appreciable quantities. The present 
5 invention is also based on the finding that it is possible to crystallize the protein mannosidase 
II, both alone and in combination with a selection of different ligands. More particularly, it 
has been possible to identify the specific sites of mannosidase II which are associated with 
binding to swainsonine and the mannose-like compound deoxymannojirimycin (DMNJ). The 
structure was also shown to exhibit a previously unobserved folding pattern enabling the 
10 design of novel GMII-specific inhibitors. 

v 3 Binding domains are of significant utility in drug discovery. The association of natural ligands 

and substrates with the binding domains of mannosidases is the basis of many biological 
mechanisms. In addition, many drugs (e.g. swainsonine) exert their effects through 
if: 15 association with the binding domains of mannosidases. The associations may occur with all or 
any parts of a binding domain. An understanding of these associations will lead to the design 
;f and optimization of drugs having more favorable associations with their target enzyme and 

U thus provide improved biological effects. Therefore, information about the shape and structure 

s r, £ of mannosidases and their ligand-binding domains is invaluable in designing potential 

20 modulators of mannosidases for use in treating diseases and conditions associated with or 
modulated by the mannosidases. 

Thus, according to a first aspect of the invention, there is provided a crystal comprising a 
mannosidase II ligand-binding domain. In a preferred embodiment the crystal is a crystal of a 
25 mannosidase II enzyme. The structure of a crystal of mannosidase II has been solved and is 
set forth in Table 1, Table 2, or Table 8. 

The crystal may comprise a complex between a mannosidase II ligand-binding domain and at 
least one ligand, for example an inhibitor of mannosidase II. In a particularly preferred 
30 embodiment that crystal comprises a complex between mannosidase II and swainsonine. The 
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structure of a crystal of a complex between mannosidase II and swainsonine has been solved, 
and is set forth in Table 2 or Table 8. 

In a second aspect, the present invention provides a crystal comprising swainsonine or a 
5 derivative thereof. In a preferred embodiment, the crystal comprises a complex between 
swainsonine (or a derivative thereof) and a mannosidase II ligand-binding domain. The 
structure of a crystal of a complex between mannosidase II and swainsonine has been solved, 
and is set forth in Table 2, or Table 8. 

10 According to a third aspect of the invention, there is provided a model of at least part of a 
mannosidase II, made using a crystal according to the first aspect of the invention. In a 
preferred embodiment, the model comprises the mannosidase II ligand-binding domain. 
There is also provided a model of swainsonine or a derivative thereof made using a crystal 
according to the second aspect of the invention. 

15 

The crystal of the first and second aspect of the invention and a model of the third aspect of 
the invention may be provided in the form of a computer readable medium. 

The crystals and models of earlier aspects of the invention may provide information about the 
20 atomic contacts involved in the interaction between the enzyme and a known ligand, which 
can be used to screen for unknown ligands. According to a fourth aspect of the invention, 
there is provided a method of screening for a ligand capable of binding a mannosidase II 
ligand binding domain, comprising the use of a crystal according to the first or second aspects 
of the invention or a model according to the third aspect of the invention. For example, the 
25 method may comprise the step of contacting the ligand binding domain with a test compound, 
and determining if said test compound binds to said ligand binding domain. 

In a fifth aspect, the present invention provides a ligand identified by a screening method of 
the fourth aspect of the invention. Preferably the ligand is a modulator that is capable of 
30 modulating the activity of a mannosidase II enzyme. 
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A crystal and/or model of the invention may be used to design, evaluate, and identity 
modulators of a mannosidase II or homologues thereof other than ligands that associate with a 
mannosidase II. The modulators may be based on the shape and structure of a mannosidase II, 
5 or a ligand binding domain or atomic interaction, or atomic contacts thereof. Therefore 
modulators may be derived from ligand binding domains or analogues or parts thereof. 

Modulators (e.g. ligands) which are capable of modulating the activity of mannosidase II 
enzymes have considerable therapeutic and prophylactic potential. In a sixth aspect, the 
10 present invention provides the use of a modulator of the invention in the manufacture of a 

0 medicament to treat and/or prevent a disease in a mammalian patient. There is also provided 
,S a pharmaceutical composition comprising a modulator and a method of treating and/or 
j:^ preventing a disease comprising the step of administering such a modulator or pharmaceutical 
IU composition to a mammalian patient. 

1 15 

^ A potential modulator of a mannosidase II identified by a method of the present invention 

may be confirmed as a modulator by synthesizing the compound, and testing its effect on the 
jj * enzymatic activity of mannosidase II in an assay. Such assays are known in the art. 

20 Therefore, the methods of the invention for identifying ligands or modulators may comprise 
one or more of the following additional steps: 

(a) testing whether the modulator or ligand is a modulator of the activity of a mannosidase 
II, preferably testing the activity of the modulator or ligand in cellular assays and 
animal model assays; 
25 (b) modifying the modulator or ligand; 

(c) optionally rerunning steps (a) or (b); and 

(d) preparing a pharmaceutical composition comprising the modulator or ligand. 

Steps (a), (b) (c) and (d) may be carried out in any order, at different points in time, and they 
need not be sequential. 
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The crystal structures and models described above also provide information about the 
secondary and tertiary structure of mannosidase II enzymes. This can be used to gleen 
structural information about other, previously uncharacterised polypeptides. According to a 
seventh aspect of the invention there is provided a method of determining the secondary 
5 and/or tertiary structures of polypeptides with unknown (or only partially known) structure 
comprising the step of using such a crystal or model. The polypeptide under investigation is 
preferably structurally or functionally related to the mannosidase II enzyme. For example, the 
polypeptide may show a degree of homology over some or all parts of the primary amino acid 
sequence. Alternatively, the polypeptide may perform an analogous function or be suspected 
10 to show a similar catalytic mechanism to the mannosidase II enzyme. 

Aspects of the invention are presented in the accompanying claims and in the following 
description, drawings, and Tables. 

15 DESCRIPTION OF THE FIGURES AND TABLES 

The present invention will now be described only by way of example and with reference to 
the accompanying figures and tables, wherein: 

20 Figure 1 shows the active site of mannosidase II. 

Figure 2 shows the secondary structure of Drosophila Golgi a-mannosidase II. Helices are in 
blue and p sheets are in red. 

25 Figure 3 shows the Drosophila golgi a-mannosidase II molecule with the colours representing 
where it is identical to human GMII. The red and blue represent deletions or insertions with 
respect to the human sequence. The green is a disulphide bond. 
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Figure 4 shows the whole Drosophila golgi a-mannosidase II molecule in sticks with residues 
that are identical in the lysosomal manll as coloured balls (red or blue depending whether 
they are in the N-terminal or C-terminal part of the molecule). 

Figure 5 shows the active site of a Drospholiga mannosidase. 

Figure 6 shows the DNA sequence of an expressed Drosophila mannosidase. 

Figure 7 shows an alignment of expressed secreted Drosophila mannosidase with human 
mannosidase. 

Figure 8 shows A). Schematic representation of the high mannose GIcNAcMan 5 GlcNAc 2 
substrate of dGMII. B) Ribbon representation of the dGMII structure, top- view, C) side-view. 
The loop formed by residues 527-540 is shown in yellow. All molecular images were 
prepared using MOLSCRIPT (Kraulis, 1991) and rendered using Raster3D (Merritt and 
Bacon, 1997) 

Figure 9 shows a molecular surface representation of the convex face (A) and the planar face 
(B) of the dGMII molecule. Molecular surface images are colored for electrostatic potential 
(red for negative, blue for positive). C) Molecular surface representation of the planar face of 
dGMII, colored for homology with the sequence of human Golgi a-mannosidase II (dark- 
green for identical, light-green for homologous, yellow for similar, and white for different 
residues). Alignment of human and Drosophila Golgi a-mannosidase II sequences (SwissProt 
accession numbers Q 16706 and Q24451, respectively) was performed using the GAP 
program of the Wisconsin package (Version 1 0, Genetics Computer Group) using the default 
parameters without any manual intervention. The scores were used to colour the molecular 
surface. All molecular surface images were produced using GRASP (Nicholls et al., 1991). 

Figure 10 shows stereo views of the active site of dGMII with bound Tris (A), DMNJ (B), and 
swainsonine (C) molecules. The active site zinc ion is shown in turquoise, the bound inhibitor 
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molecules are rendered in gold and water molecules are represented as transparent red 
spheres. Hydrogen bonds are shown as blue dashed lines. 

Figure 1 1 shows A) Molecular surface representation of dGMII showing the position of the 
5 active site bound Tris molecule and the 2-methyl-2,4-pentanediol (MPD) binding site. B) 
Molecular surface representation of dGMII with the GlcNAcMansGlcNAc2 substrate modeled 
into the binding pocket. The substrate molecule is positioned into the binding pocket with 
al,6-linked mannose M6 (shown in green) docked into the active site and pi,2-GlcNAc 
residue G3 (shown in black) placed in the MPD binding site. Individual mannose residues of 
10 the substrate are colored according to the coloring scheme used in Figure 8A. C) 
% Representation of the sequential trimming of the a 1,6 (M6) and al,3-Hnked (M7) mannose 

? 0 residues. Figure 11A was produced using LIGPLOT (Wallace et al., 1995). All molecular 

q surface images were produced using GRASP (Nicholls et al., 1991). 

e Ha? 

P 15 Table 1 shows the structural coordinates of a Drosophila Golgi a-mannosidase II. 

r fs Table 2 shows the structural coordinates of a Drosophila Golgi a-mannosidase II with 

8;f swainsonine. 

f ■ 

20 Table 3 shows the ligand binding domain (active site) of a mannosidase II. 

Table 4 shows the intermolecular contacts of a Drosophila Golgi a-mannosidase II 
swainsonine complex. 

25 Table 5 shows crystallographic refinement statistics for the native Drosophila Golgi 
mannosidase IL 

Table 6 shows crystallographic refinement statistics for Drosophila Golgi mannosidase II 
associated with swainsonine. 
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Table 7 shows a list of Mannosidase II enzymes. 

Table 8 shows the structural coordinates of a Drosophila Golgi ct-mannosidase II with 
swainsonine, a zinc ion, Tris molecule and an N-glycan. 

5 

Table 9 shows data collection statistics for MAD (Se-Met) of dGMII and native dGMII. 

Table 10 shows refinement statistics of dGMII, dGMII-swainsonine complex, and dGMII- 
DMNJ complex. 

10 

In Tables 1, 2, and 8 from the left, the second column identifies the atom number; the third 
identifies the atom type; the fourth identifies the amino acid type; the sixth identifies the residue 
number; the seventh identifies the x coordinates; the eighth identifies the y coordinates; the 
ninth identifies the z coordinates; the tenth identifies the occupancy; and the eleventh identifies 
15 the temperature factor. 

DETAILED DESCRIPTION OF THE INVENTION 

Unless otherwise indicated, all terms used herein have the same meaning as they would to one 
20 skilled in the art of the present invention. Practitioners are particularly directed to Current 
Protocols in Molecular Biology (Ansubel) for definitions and terms of the art. Abbreviations 
for amino acid residues are the standard 3-letter and/or 1 -letter codes used in the art to refer to 
one of the 20 common L-amino acids, 

25 In a first aspect, the present invention relates to a crystal comprising a mannosidase II ligand 
binding domain. 

Crystal 

As used herein, the term "crystal" means a structure (such as a three dimensional (3D) solid 
30 aggregate) in which the plane faces intersect at definite angles and in which there is a regular 
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structure (such as internal structure) of the constituent chemical species. Thus, the term 
"crystal" can include any one of: a solid physical crystal form such as an experimentally 
prepared crystal, a crystal structure derivable from the crystal (including secondary and/or 
tertiary and/or quaternary structural elements), a 2D and/or 3D model based on the crystal 
5 structure, a representation thereof such as a schematic representation thereof or a diagrammatic 
representation thereof, or a data set thereof for a computer. 



In one aspect, the crystal is usable in X-ray crystallography techniques. Here, the crystals used 
can withstand exposure to X-ray beams used to produce a diffraction pattern data necessary to 
10 solve the X-ray crystallographic structure. A crystalline form of a mannosidase, may be 
characterized as being capable of diffracting x-rays in a pattern defined by one of the crystal 

is JJ 

kQ : forms depicted in Blundel et al 1976, Protein Crystallography, Academic Press. 

W A crystal of the invention includes a mannosidase II or part thereof (e.g. ligand binding 

IJ1 15 domain) in association with one or more moieties, including heavy-metal atoms i.e. a 
derivative crystal, a metal cofactor, or one or more ligands or substrates i.e. a co-crystal. 



The term "associate", "association" or "associating" refers to a condition of proximity 
between a moiety (i.e. chemical entity or compound or portions or fragments thereof), and a 
20 mannosidase II, or parts or fragments thereof (e.g. binding sites or domains). The association 
may be non-covalent i.e. where the juxtaposition is energetically favoured by for example, 
hydrogen-bonding, van der Waals, or electrostatic or hydrophobic interactions, or it may be 
covalent. 



25 The term "heavy-metal atoms" refers to an atom that can be used to solve an x-ray 
crystallography phase problem, including but not limited to a transition element, a lanthanide 
metal, or an actinide metal. Lanthanide metals include elements with atomic numbers between 
57 and 71, inclusive. Actinide metals include elements with atomic numbers between 89 and 
103, inclusive. 
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Multiwavelength anomalous diffraction (MAD) phasing may be used to solve protein 
structures using selenomethionyl (SeMet) proteins. Therefore, a complex of the invention may 
comprise a crystalline mannosidase 11 or part thereof (e.g. ligand binding domain) with 
selenium associated with the methionine residues of the protein. 

In an embodiment of the invention, a ligand binding domain is in association with a metal 
cofactor in the crystal. A "metal cofactor" refers to a metal required for mannosidase activity 
and/or stability. For example, the metal cofactor may be zinc, and other similar atoms or 
metals. In a preferred embodiment a LBD is in association with Zn 2+ . 

A ligand binding domain in a complex with a cofactor preferably comprises one or more of 
the residues involved in coordination of a Zn 2+ ion, namely: aspartate residues 92 and 204, 
and histidines 90 and 471. 



15 The crystal may comprise a complex between a ligand-binding domain and one or more 
ligands. In other words the ligand binding domain may be associated with one or more 
ligands in the crystal. The ligand may be any compound which is capable of interacting 
stably and specifically with the ligand binding domain. The ligand may, for example, be an 
inhibitor of mannosidase II, including but not limited to swainsonine and the mannose-like 

20 compound deoxymannojirimycin (DMNJ). 

In a preferred embodiment the ligand associated with said mannosidase II ligand binding 
domain is swainsonine, or an analogue or derivative thereof. Swainsonine is an indolizidine 
alkaloid found in a variety of sources (Colegate et al., (1979); Molyneux and James (1981); 
25 and Schneider et al. (1983) all as above) which has been known to be an inhibitor of 
mannosidase II enzymes for some time. Derivatives of swainsonine are also known in the art, 
for example US 5962467, US 5,650,413, and U.S. 6,048,870, describe various derivatives of 
swainsonine, processes for their preparation and their use as therapeutic agents. 
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In an embodiment a crystal of the invention comprises a ligand binding domain of a 
mannosidase II in association with swainsonine. These complexes may have the structural 
coordinates shown in Table 2, or Table 8. 

In a second aspect, the present invention also provides a crystal comprising swainsonine or a 
derivative thereof. Preferably the swainsonine molecule has the three dimensional structure 
defined by the relevant structural coordinates shown in Table 2, or Table 8, 

The crystal may also comprise a complex between mannosidase II (or part thereof) and a 
substrate, or analogue thereof. The term "substrate" refers to molecules that associate with a 
mannosidase II as it hydrolyzes linkages between mannose residues. Mannosidases II enzymes 
release a-D-mannose as a first formed product and they follow a double-displacement 
mechanism in which a glycosyl-enzyme intermediate is formed and hydrolyzed via 
oxocarbenium ion-like transition states. The formation of the intermediate is assisted by general 
acid catalysis from a carboxylic acid located in the active site. The residue also serves as the 
general base catalyst for the second deglycosylation step. A second carboxylic acid serves as the 
nucleophile that forms the covalent intermediate. Thus, the substrate molecule may comprise 
molecules such as the glycosyl moiety that forms an intermediate with the enzyme. (See 
Howard, S. et al, J. Biol. Chem. (1998) 273. 2067-2072 and references 11, 12, 14, 15, and 16 
therein). An analogue of a substrate is one which mimics the substrate binding in the LBD, but 
which is incapable (or has a significantly reduced capacity) to take part in the catalytic reaction. 

A number of substrates for Golgi a-mannosidase II are known including the artificial 
substrate PNP-mannose (Rabouille et al (1999) as above). Lysosomal mannosidase II is 
involved in glycoprotein degradation. In particular lysosomal mannosidase II hydrolyses 
a(l,2) a(l,3) and a(l,6) linkages betwwen mannose residues. Substrates for this enzyme are 
thought to include high mannose, hybrid and complex oligosaccharides. 

In an embodiment, the substrate comprises GlcNAcMan 5 GlcNAc2-Asn-. 
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A complex may comprise one or more of the intermolecular interactions identified in Table 4. 
A structure of a complex of the invention may be defined by selected intermolecular contacts, 
preferably the intermolecular contacts as defined in Table 4. 

5 A crystal of the invention may be characterized by an N-terminal a/p domain, a C-terminal 
portion comprising a three-helical bundle, and an all-p C-terminal domain, connected by 5 
internal disulfide bonds and stabilized by a zinc binding site (Figure 8B). 

The N-terminal a/p domain is characterized as follows: 
10 (a) comprising an inner core of three p-sheets (A, B and C, Figure 8B) consisting of 1 1 , 
mostly parallel P-strands, surrounded by 16 a-helices; 

(b) comprising a GlcNAc residue at a consensus N-glycosylation site (Asn-194), located 
at the N-terminus of helix 7. 

(c) stabilized by three disulfide bonds: between Cys-31 and Cys-1032 connecting the N 
15 and C-terminal extremes of dGMII; Cys-275 and Cys-282 linking helices 10 and 11; 

Cys-283 and Cys-297 linking helix 1 1 with a loop between helix 13 and the core of 
parallel p-sheets. 

The C-terminal portion is characterized as follows: 
20 (a) a three-helix bundle comprises helices 18, 20 and 21 connected to the N-terminal a/p- 
domain via a zinc binding site. 

(b) a zinc ion coordinated in a Ts-square-based pyramidal geometry involving residues: 
Asp-90, His-92, Asp-204 and His-471. 

(c) two immunoglobulin-like domains: a small P-sandwich consisting of 12 anti-parallel 
25 strands from p-sheets D and E, and a large 21 -strand structure involving p-sheets F 

andG. 

(d) a barrel formed by the three-helix bundle, helix-23, and the two p-sandwich structures 
providing a narrow pore in the center of the C-terminal domain. 
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The barrel in the C-terminal portion is lined by six arginine residues: Arg-540, 565, 617, 770, 
777 and 893, contributing to the overall positive charge of the pore (Figure 9A). A hairpin 
loop, connecting two strands of p-sheet D (Figure 8B and C, residues 527-540, shown in 
yellow) protrudes into the center of the barrel on the planar side of the molecule. Arginine 
5 residue 530, located at the tip of the type-I P-turn in this loop, plugs the pore preventing an 
open channel through the protein. The resulting crater-like cavity on the convex side of the 
molecule is 20 A deep, with a diameter of 20 A fiinneling to 8 A at the bottom of the cavity. 
The loop has a higher degree of flexibility compared to the rest of the structure (average B- 
factor values: -33 A 2 and -15A 2 , respectively). 

10 

A crystal of the invention may enable the determination of structural data for a ligand or 
substrate. In order to be able to derive structural data for the ligand or substrate, it is necessary 
for the molecule to have sufficiently strong electron density to enable a model of the molecule 
to be built using standard techniques. For example, there should be sufficient electron density 
15 to allow a model to be built using XTALVIEW (McRee 1992 J. Mol. Graphics. 10 44-46). 

Preferably, the crystal of the invention belongs to space group P2i2i2j. 

The term "space group" refers to the lattice and symmetry of the crystal. In a space group 
20 designation the capital letter indicates the lattice type and the other symbols represent 
symmetry operations that can be carried out on the contents of the asymmetric unit without 
changing its appearance. 

Preferably, a crystal of said complex comprises a unit cell having the following unit 
25 dimensions: a=69 (±5) A, b=l 10 (±5) A, c=139 (±5) A. 

The term "unit cell" refers to the smallest and simplest volume element (i.e. parallelpiped- 
shaped block) of a crystal that is completely representative of the unit of pattern of the crystal. 
The unit cell axial lengths are represented by a, b, and c. Those of skill in the art understand 
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that a set of atomic coordinates determined by X-ray crystallography is not without standard 
error. 

In a highly preferred embodiment, the crystal comprises the structural coordinates as shown in 
5 Table 1 , Table 2, or Table 8. 

As used herein, the term "structural coordinates" refer to a set of values that define the 
position of one or more amino acid residues with reference to a system of axes. The term 
refers to a data set that defines the three dimensional structure of a molecule or molecules 

10 (e.g. Cartesian coordinates, temperature factors, and occupancies). Structural coordinates can 
be slightly modified and still render nearly identical three dimensional structures. A measure 
of a unique set of structural coordinates is the root-mean-square deviation of the resulting 
structure. Structural coordinates that render three dimensional structures (in particular a three 
dimensional structure of an SGC domain) that deviate from one another by a root-mean- 

15 square deviation of less than 5 A, 4 A, 3 A, 2 A, or 1.5 A may be viewed by a person of 
ordinary skill in the art as very similar. 

Variations in structural coordinates may be generated because of mathematical manipulations 
of the structural coordinates of a mannosidase described herein. For example, the structural 
20 coordinates of Table 1, 2, or 8 may be manipulated by crystallographic permutations of the 
structural coordinates, fractionalization of the structural coordinates, integer additions or 
substractions to sets of the structural coordinates, inversion of the structural coordinates or 
any combination of the above. 

25 Variations in the crystal structure due to mutations, additions, substitutions, and/or deletions 
of the amino acids, or other changes in any of the components that make up the crystal may 
also account for modifications in structural coordinates. If such modifications are within an 
acceptable standard error as compared to the original structural coordinates, the resulting 
structure may be the same. Therefore, a ligand that bound to a ligand binding domain of a 

30 mannosidase would also be expected to bind to another ligand binding domain whose 
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structural coordinates defined a shape that fell within the acceptable error. Such modified 
structures of a ligand binding domain thereof are also within the scope of the invention. 

Various computational analyses may be used to determine whether a molecule or the ligand 
binding domain thereof is sufficiently similar to all or parts of a ligand binding domain 
thereof. Such analyses may be carried out using conventional software applications and 
methods as described herein. 

The crystal may also be specifically characterised by the refinement statistics set out in Tables 
5, 6, or 10. 

MANNOSIDASE II 

The term "mannosidase II" refers to eukaryotic mannosidases involved in the biosynthesis of 
glycoproteins, glycolipids, glycosylphosphatidylinositols and other complex glycoconjugates, 
and prokaryotic mannosidases involved in the synthesis of carbohydrate structures of bacteria 
and viruses. In particular, the term refers to the class of mannosidases categorized as class II 
mannosidases, based on sequence alignment, belonging to family 38 in Henrissat' s 
glycosidase classification (Moremen, K.W. et al (1994) GlycoBiology 4, 1 13-125; Henrissat, 
B. and Bairoch A. (1996) Biochem J. 316, 695-696; Henrissat, B. and Bairoch A. (1993) 
Biochem J. 293, 781-788; Henrissat, B. and Bairoch A. (1991) Biochem J. 280, 309-316). 
Examples of mannosidase II enzymes include those listed in Table 7 (from http://afmb.cnrs- 
mrs.fr/-pedro/CAZY/ghf_38.html). 

The invention generally relates to mannosidase II enzymes and parts thereof. Mannosidase II 
enzymes catalyze the first committed step in the biosynthesis of complex N-glycans and they 
control conversion of high mannose to complex N-glycans. 

Mannosidases are derivable from a variety of sources, including viruses, bacteria, fungi, 
plants, and animals. In a preferred embodiment the glycosyltransferase is derivable from an 
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animal, preferably a mammal including but not limited to bovine, ovine, porcine, murine 
equine, most preferably a human. The enzyme may be from any source, whether natural, 
synthetic, semi-synthetic, or recombinant. 

5 A mannosidase or part thereof in the present invention may be a wild type enzyme, or part 
thereof, or a mutant, variant or homologue of such an enzyme. 

The term "wild type" refers to a polypeptide having a primary amino acid sequence which is 
identical with the native enzyme (for example, the mammalian enzyme). 

10 

The term "mutant" refers to a polypeptide having a primary amino acid sequence which 
differs from the wild type sequence by one or more amino acid additions, substitutions or 
deletions. Preferably, the mutant has at least 90% sequence identity with the wild type 
sequence. Preferably, the mutant has 20 mutations or less over the whole wild-type sequence. 
15 More preferably the mutant has 10 mutations or less, most preferably 5 mutations or less over 
the whole wild-type sequence. 

The term "variant" refers to a naturally occurring polypeptide which differs from a wild-type 
sequence. A variant may be found within the same species (i.e. if there is more than one 
20 isoform of the enzyme) or may be found within a different species. Preferably the variant has 
at least 90% sequence identity with the wild type sequence. Preferably, the variant has 20 
mutations or less over the whole wild-type sequence. More preferably, the variant has 10 
mutations or less, most preferably 5 mutations or less over the whole wild-type sequence. 

25 The term "part" indicates that the polypeptide comprises a fraction of the wild-type amino 
acid sequence. It may comprise one or more large contiguous sections of sequence or a 
plurality of small sections. In an embodiment, the "part" comprises a wild type mannosidase 
enzyme with the cytosolic and transmembrane domains and most of the stalk region 
eliminated, preferably the "part" comprises amino acid residues 31-1044 of Golgi ct- 

30 mannosidase. The "part" may comprise a ligand binding domain as described herein. The 
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polypeptide may also comprise other elements of sequence, for example, it may be a fusion 
protein with another protein (such as one which aids isolation or crystallisation of the 
polypeptide). Preferably the polypeptide comprises at least 50%, more preferably at least 
65%, most preferably at least 80% of the wild-type sequence. 

5 

The term "homologue" means a polypeptide having a degree of homology with the wild-type 
amino acid sequence. The term "homology" can be equated with "identity". 

In the present context, an homologous sequence is taken to include an amino acid sequence 
10 which may be at least 75, 85 or 90% identical, preferably at least 95 or 98% identical to the 
wild-type sequence. Typically, the homologues will comprise the same sites (for example 
ligand binding domain) as the subject amino acid sequence. Although homology can also be 
considered in terms of similarity (i.e. amino acid residues having similar chemical 
properties/functions), in the context of the present invention it is preferred to express 
15 homology in terms of sequence identity. 

Homology comparisons can be conducted by eye, or more usually, with the aid of readily 
available sequence comparison programs. These commercially available computer programs 
can calculate % homology between two or more sequences. 

20 

Percentage homology may be calculated over contiguous sequences, i.e. one sequence is 
aligned with the other sequence and each amino acid in one sequence is directly compared 
with the corresponding amino acid in the other sequence, one residue at a time. This is called 
an "ungapped" alignment. Typically, such ungapped alignments are performed only over a 
25 relatively short number of residues. 

Although this is a very simple and consistent method, it fails to take into consideration that, 
for example, in an otherwise identical pair of sequences, one insertion or deletion will cause 
the following amino acid residues to be put out of alignment, thus potentially resulting in a 
30 large reduction in % homology when a global alignment is performed. Consequently, most 
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sequence comparison methods are designed to produce optimal alignments that take into 
consideration possible insertions and deletions without penalising unduly the overall 
homology score. This is achieved by inserting "gaps" in the sequence alignment to try to 
maximise local homology. 

5 

However, these more complex methods assign "gap penalties" to each gap that occurs in the 
alignment so that, for the same number of identical amino acids, a sequence alignment with as 
few gaps as possible - reflecting higher relatedness between the two compared sequences - 
will achieve a higher score than one with many gaps. "Affine gap costs" are typically used 

10 that charge a relatively high cost for the existence of a gap and a smaller penalty for each 
subsequent residue in the gap. This is the most commonly used gap scoring system. High 
gap penalties will of course produce optimised alignments with fewer gaps. Most alignment 
programs allow the gap penalties to be modified. However, it is preferred to use the default 
values when using such software for sequence comparisons. For example when using the 

15 GCG Wisconsin Bestfit package the default gap penalty for amino acid sequences is -12 for a 
gap and -4 for each extension. 

Calculation of maximum % homology therefore firstly requires the production of an optimal 
alignment, taking into consideration gap penalties. A suitable computer program for carrying 

20 out such an alignment is the GCG Wisconsin Bestfit package (University of Wisconsin, 
U.S.A.; Devereux et al, 1984, Nucleic Acids Research 12:387). Examples of other software 
than can perform sequence comparisons include, but are not limited to, the BLAST package 
(see Ausubel et al, 1999 ibid - Chapter 18), FASTA (Atschul et al, 1990, J. Mol. Biol., 403- 
410) and the GENE WORKS suite of comparison tools. Both BLAST and FASTA are 

25 available for offline and online searching (see Ausubel et ai, 1999 ibid, pages 7-58 to 7-60). 
However, for some applications, it is preferred to use the GCG Bestfit program. A new tool, 
called BLAST 2 Sequences is also available for comparing protein and nucleotide sequence 
(see FEMS Microbiol Lett 1999 174(2): 247-50; FEMS Microbiol Lett 1999 177(1): 187-8 
and tatiana@ncbi.nlm.nih.gov). 



30 
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Although the final % homology can be measured in terms of identity, the alignment process 
itself is typically not based on an all-or-nothing pair comparison. Instead, a scaled similarity 
score matrix is generally used that assigns scores to each pairwise comparison based on 
chemical similarity or evolutionary distance. An example of such a matrix commonly used is 
the BLOSUM62 matrix - the default matrix for the BLAST suite of programs. GCG 
Wisconsin programs generally use either the public default values or a custom symbol 
comparison table if supplied (see user manual for further details). For some applications, it is 
preferred to use the public default values for the GCG package, or in the case of other 
software, the default matrix, such as BLOSUM62. 

Once the software has produced an optimal alignment, it is possible to calculate % homology, 
preferably % sequence identity. The software typically does this as part of the sequence 
comparison and generates a numerical result. 

The sequences may have deletions, insertions or substitutions of amino acid residues which 
produce a silent change and result in a functionally equivalent enzyme. Deliberate amino acid 
substitutions may be made on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long as the 
secondary binding activity of the substance is retained. For example, negatively charged 
amino acids include aspartic acid and glutamic acid; positively charged amino acids include 
lysine and arginine; and amino acids with uncharged polar head groups having similar 
hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, asparagine, 
glutamine, serine, threonine, phenylalanine, and tyrosine. 

Conservative substitutions may be made, for example according to the Table below. Amino 
acids in the same block in the second column and preferably in the same line in the third 
column may be substituted for each other: 
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ALIPHATIC 


Non-polar 


GAP 






ILV 




Polar - uncharged 


CSTM 






NQ 




Polar - charged 


DE 






KR 


AROMATIC 




HFWY 



The polypeptide may also have a homologous substitution (substitution and replacement are 
5 both used herein to mean the interchange of an existing amino acid residue, with an 
alternative residue) i.e. like-for-like substitution such as basic for basic, acidic for acidic, 
polar for polar etc. Non-homologous substitution may also occur i.e. from one class of 
residue to another or alternatively involving the inclusion of unnatural amino acids such as 
ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine (hereinafter referred to 
10 as B), norleucine ornithine (hereinafter referred to as O), pyriylalanine, thienylalanine, 
naphthylalanine and phenylglycine. 

Replacements may also be made by unnatural amino acids include; alpha* and alpha- 
disubstituted* amino acids, N-alkyl amino acids*, lactic acid*, halide derivatives of natural 

15 amino acids such as trifluorotyrosine*, p-CI-phenylalanine*, p-Br-phenylalanine*, p-I- 
phenylalanine*, L-allyl-glycine*, 6-alanine*, L-a-amino butyric acid*, L-y-amino butyric 
acid*, L-a-amino isobutyric acid*, L-s-amino caproic acid # , 7-amino heptanoic acid*, L- 
methionine sulfone**, L-norleucine*, L-norvaline*, p-nitro-L-phenylalanine*, L- 
hydroxyproline\ L-thioproIine*, methyl derivatives of phenylalanine (Phe) such as 4-methyI- 

20 Phe*, pentamethyi-Phe*, L-Phe (4-amino) # , L-Tyr (methyl)*, L-Phe (4-isopropyl)*, L-Tic 
(1,2,3,4-tetrahydroisoquinolineo-carboxyl acid)*, L-diaminopropionic acid n and L-Phe (4- 
benzyl)*. The notation * has been utilised for the purpose of the discussion above (relating to 
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homologous or non-homologous substitution), to indicate the hydrophobic nature of the 
derivative whereas # has been utilised to indicate the hydrophilic nature of the derivative, #* 
indicates amphipathic characteristics. 

5 Variant amino acid sequences may include suitable spacer groups that may be inserted 
between any two amino acid residues of the sequence including alkyl groups such as methyl, 
ethyl or propyl groups in addition to amino acid spacers such as glycine or P-alanine residues. 
A further form of variation, involving the presence of one or more amino acid residues in 
peptoid form, will be well understood by those skilled in the art. For the avoidance of doubt, 
10 "the peptoid form" is used to refer to variant amino acid residues wherein the a-carbon 
substituent group is on the residue's nitrogen atom rather than the a-carbon. Processes for 
preparing peptides in the peptoid form are known in the art, for example Simon RJ et aL, 
PNAS (1992) 89(20), 9367-9371 and Horwell DC, Trends Biotechnol. (1995) 13(4), 132-134. 



P 15 LIGAND-BINDING DOMAIN 

!l T ..Ji 

As used herein, the term "ligand binding domain (LBD) ,f refers to a region of a molecule or 
M;- molecular complex that as a result of its shape, favourably associates with a ligand or a part 

thereof. For example, it may be a region of a mannosidase that is responsible for binding a 
20 substrate or modulator (e.g. swainsonine). With reference to the crystal of the present 
invention residues in the LBD may be defined by their spatial proximity to the ligand (for 
example swainsonine or substrate) in the crystal structure. 

"Ligand" refers to a compound or entity that associates with a ligand binding domain, 
25 including substrates or analogues or parts thereof, or modulators of a mannosidase including 
inhibitors. A ligand may be designed rationally by using a model according to the present 
invention. 
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The term "ligand binding domain (LBD)" also includes a homologue of the ligand binding 
domain or a portion thereof. 
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As used herein, the term "homologue" in reference to a ligand binding domain refers to ligand 
binding domain or a portion thereof which may have deletions, insertions or substitutions of 
amino acid residues as long as the binding specificity of the molecule is retained. In this 
5 regard, deliberate amino acid substitutions may be made on the basis of similarity in polarity, 
charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues as long as the binding specificity of the ligand binding domain is retained. 

As used herein, the term "portion thereof means the structural coordinates corresponding to a 
10 sufficient number of amino acid residues of the mannosidase II LBD (or homologues thereof) 
that are capable of interacting with a test compound capable of binding to the LBD. This term 
.'2 includes mannosidase II ligand binding domain amino acid residues having an amino acid 

residues from about 4A to about 5A of a bound compound or fragment thereof. Thus, for 
f j example, the structural coordinates provided in the crystal structure may contain a subset of 

15 the amino acid residues in the LBD which may be useful in the modelling and design of 
- compounds that bind to the LBD. 

A ligand binding domain may be defined by its association with a ligand. With reference to a 
O crystal of the present invention, residues in the LBD may be defined by their spatial proximity 

20 to a ligand in the crystal structure. For example, such may be defined by their proximity to a 
substrate or modulator (e.g. swainsonine). 

The active site of a mannosidase II crystal of the invention may be characterized as follows: 
(a) a small cavity lined by aromatic residues Trp-95, Phe-206, Tyr-269 and Tyr-727; 
25 (b) a zinc ion binding site within the cavity characterized by a Ts-square-based pyramidal 
geometry and 'elec-His-Zn motifs'. 

A binding domain for a GMII inhibitor such as swainsonine and DMNJ, comprises one or 
more of Trp-95, Phe-206 and Tyr-727 which form a binding cavity for the inhibitor. The 
30 inhibitor ring structures can be stacked against Trp-95, and stabilized by hydrogen bonds and 
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interactions with the zinc ion. When bound to an inhibitor the zinc ion binding domain of the 
GMII can be transformed into Te-octahedral coordination. The binding domain allows for the 
formation of a hydrogen bond between the zinc-coordinating OD1 oxygen of Asp-204 and the 
N4 nitrogen at the fusion of the five and six-membered rings of swainsonine. The zinc 
5 coordinating oxygen atoms of the inhibitors are involved in hydrogen bond interactions with 
the neighboring metal binding residues of the enzyme. 

The position of the inhibitor molecules is stabilized in the active site by hydrogen bonds 
between carboxylic oxygens OD1 and OD2 of residue Asp-472 and hydroxy 1 oxygens 03 and 
10 04 (05 in swainsonine) of the inhibitors. DMNJ is involved in additional hydrogen bonds, via 
water molecules, with the NH2 nitrogen of Arg-228, the hydroxyl oxygen of Tyr-269, the 
backbone carbonyl oxygen of Arg-876, and the 0D1 oxygen of Asp-204. 

In an embodiment, a ligand binding domain comprises one or more of the following amino 
15 acid residues: His 471, His 90, and Asp 92, and Asp 204; or a homologue thereof 

In a second embodiment, a ligand binding domain comprises one or more of the following 
amino acid residues: Trp-95, Phe-206, Tyr-269, and Tyr-727. 

20 In another embodiment, a ligand binding domain comprises one or more of the following 
amino acid residues: Asp-92, Asp-204, His-90, His-471. 

In still another embodiment, a ligand binding domain comprises one or more of the following 
amino acid residues: His 471, Asp 204, Asp 341, His 90, Asp 92, Asp 472, Phe 206, Tyr 727 
25 and Trp 95; or a homologue thereof 

In yet another embodiment a ligand binding domain comprises one or more of the following 
groups: 
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(a) GVWKQG (residues 60-65) 
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(b) VFVVPHSHND (residues 83-92) 

(c) WAIDPFGH (residues 20 1 -208) 

(d) HMMPFYS YDIPHTCGPDPK v /iCCQFDFKR (residues 262-289) 

(e) LLV A PLGDDFR (residues 334-343): 

5 

In an aspect of the invention, a ligand binding domain comprises one or more of the enzyme 
residues shown in Table 3 and/or Table 4. 



A crystal of a binding domain may be defined by selected atomic contacts. 

10 

O In an embodiment, the binding site of the mannosidase II inhibitor swainsonine is described in 

tjj Table 3, and details of the atomic interactions of the binding site are set out in Table 4. In the 

:^ swainsonine binding site there are direct hydrogen bonds between the inhibitor and the 

HU enzyme. Atomic contacts on the enzyme comprise Trp-95, Phe-206, Tyr-727, Asp-472, Asp 

m 15 204 (see Table 4, Figures 1 and 5). 

si 

In a particular embodiment of the invention, a secondary or three-dimensional structure of a 
1^ binding domain of a mannosidase II that associates with an inhibitor of a mannosidase II is 

W provided comprising at least two or three atomic contacts of the atomic interactions in Table 

20 4, each atomic interaction defined therein by an atomic contact (more preferably, a specific 

atom where indicated) on the inhibitor, and an atomic contact (more preferably, a specific 

amino acid residue where indicated) on the mannosidase II (i.e. enzyme atomic contact). 

Preferably, the binding domain is defined by the atoms of the enzyme atomic contacts having 

the structural coordinates for the atoms listed in Table 1, 2, or 8. 

25 

METHOD OF MAKING A CRYSTAL 
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The present invention also provides a method of making a crystal according to the invention. 
The crystal may be formed from an aqueous solution comprising a purified polypeptide 
comprising a mannosidase II or part or fragment thereof (e.g. a catalytic portion, ligand 
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binding domain). A method may utilize a purified polypeptide comprising a mannosidase II 
ligand binding domain to form a crystal 

The term "purified" in reference to a polypeptide, does not require absolute purity such as a 
5 homogenous preparation rather it represents an indication that the polypeptide is relatively 
purer than in the natural environment. Generally, a purified polypeptide is substantially free of 
other proteins, lipids, carbohydrates, or other materials with which it is naturally associated, 
preferably at a functionally significant level for example at least 85% pure, more preferably at 
least 95% pure, most preferably at least 99% pure. A skilled artisan can purify a polypeptide 
10 comprising a mannosidase II using standard techniques for protein purification. A 
substantially pure polypeptide comprising a mannosidase II will yield a single major band on 
a non-reducing polyacrylamide gel. The purity of the mannosidase II can also be determined 
by amino-terminal amino acid sequence analysis. 

15 A polypeptide used in the method may be chemically synthesized in whole or in part using 
techniques that are well-known in the art. Alternatively, methods are well known to the 
skilled artisan to construct expression vectors containing the native or mutated mannosidase II 
coding sequence and appropriate transcriptional/translational control signals. These methods 
include in vitro recombinant DNA techniques, synthetic techniques, and in vivo 

20 recombination/genetic recombination. See for example the techniques described in Sambrook 
et al. (Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory 
press (1989)), and other laboratory textbooks. (See also Sarker et al, Glycoconjugate J. 7:380, 
1990; Sarker et al, Proc. Natl. Acad, Sci. USA 88:234-238, 1991, Sarker et al, Glycoconjugate 
J. 1 1: 204-209, 1994; Hull et al, Biochem Biophys Res Commun 176:608, 1991 and Pownall 

25 et al, Genomics 12:699-704, 1992). 

Crystals may be grown from an aqueous solution containing the purified mannosidase II 
polypeptide by a variety of conventional processes. These processes include batch, liquid, 
bridge, dialysis, vapor diffusion, and hanging drop methods. (See for example, McPherson, 
30 1982 John Wiley, New York; McPherson, 1990, Eur. J. Biochem. 189: 1-23; Webber. 1991, 
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Adv. Protein Chem. 41:1-36). Generally, the native crystals of the invention are grown by 
adding precipitants to the concentrated solution of the mannosidase II polypeptide. The 
precipitants are added at a concentration just below that necessary to precipitate the protein. 
Water is removed by controlled evaporation to produce precipitating conditions, which are 
maintained until crystal growth ceases. 

Derivative crystals of the invention can be obtained by soaking native crystals in a solution 
containing salts of heavy metal atoms. A complex of the invention can be obtained by soaking 
a native crystal in a solution containing a compound that binds the polypeptide, or they can be 
obtained by co-crystallizing the polypeptide in the presence of one or more compounds. In 
order to obtain co-crystals with a compound which binds deep within the tertiary structure of 
the polypeptide it is necessary to use the second method. 

Once the crystal is grown it can be placed in a glass capillary tube and mounted onto a 
holding device connected to an X-ray generator and an X-ray detection device. Collection of 
X-ray diffraction patterns are well documented by those skilled in the art (See for example, 
Ducruix and Geige, 1992, IRL Press, Oxford, England). A beam of X-rays enter the crystal 
and diffract from the crystal. An X-ray detection device can be utilized to record the 
diffraction patterns emanating from the crystal. Suitable devices include the Marr 345 
imaging plate detector system with an RU200 rotating anode generator. 

Methods for obtaining the three dimensional structure of the crystalline form of a molecule or 
complex are described herein and known to those skilled in the art (see Ducruix and Geige 
1992, IRL Press, Oxford, England). Generally, the x-ray crystal structure is given by the 
diffraction patterns. Each diffraction pattern reflection is characterized as a vector and the 
data collected at this stage determines the amplitude of each vector. The phases of the vectors 
may be determined by the isomorphous replacement method where heavy atoms soaked into 
the crystal are used as reference points in the X-ray analysis (see for example, Otwinowski, 
1991, Daresbury, United Kingdom, 80-86). The phases of the vectors may also be determined 
by molecular replacement (see for example, Naraza, 1994, Proteins 11:281-296). The 
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amplitudes and phases of vectors from the crystalline form of a mannosidase II determined in 
accordance with these methods can be used to analyze other related crystalline polypeptides. 



The unit cell dimensions and symmetry, and vector amplitude and phase information can be 
5 used in a Fourier transform function to calculate the electron density in the unit cell i.e. to 
generate an experimental electron density map. This may be accomplished using the PHASES 
package (Furey, 1990). Amino acid sequence structures are fit to the experimental electron 
density map (i.e. model building) using computer programs (e.g. Jones, TA. et al, Acta 
Crystallogr A47, 100-119, 1991). This structure can also be used to calculate a theoretical 

10 electron density map. The theoretical and experimental electron density maps can be 
compared and the agreement between the maps can be described by a parameter referred to as 
R-factor. A high degree of overlap in the maps is represented by a low value R-factor. The R- 
factor can be minimized by using computer programs that refine the structure to achieve 
agreement between the theoretical and observed electron density map. For example, the 

15 XPLOR program, developed by Brunger (1992, Nature 355:472-475) can be used for model 
refinement. 

A three dimensional structure of a molecule or complex may be described by atoms that fit 
the theoretical electron density characterized by a minimum R value. Files can be created for 
20 the structure that defines each atom by coordinates in three dimensions. 

MODEL 

A crystal structure of the present invention may be used to make a model of the mannosidase 
25 II or a part thereof, (e.g.a ligand-binding domain). A model may, for example, be a structural 
model (or a representation thereof), or a computer model. A model may represent the 
secondary, tertiary and/or quaternary structure of the mannosidase II. The model itself may 
be in two or three dimensions. It is possible for a computer model to be in three dimensions 
despite the constraints imposed by a conventional computer screen, if it is possible to scroll 
30 along at least a pair of axes, causing "rotation" of the image. 
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Thus, for example, the structural coordinates provided in the crystal structure and/or model 
structure may comprise the amino acid residues of the mannosidase II LBD, or a portion of 
the mannosidase II LBD or a homologue thereof useful in the modelling and design of test 
5 compounds capable of binding to the mannosidase II LBD. 

As used herein, the term "modelling" includes the quantitative and qualitative analysis of 
molecular structure and/or function based on atomic structural information and interaction 
models. The term "modelling" includes conventional numeric-based molecular dynamic and 
10 energy minimization models, interactive computer graphic models, modified molecular 
mechanics models, distance geometry and other structure-based constraint models. 

Preferably, modelling is performed using a computer and may be further optimized using 
known methods. This is called modelling optimisation. 

15 

Overlays and super positioning with a three dimensional model of the mannosidase II LBD, 
and/or a portion thereof, can also be used for modelling optimisation. Additionally, alignment 
and/or modelling can be used as a guide for the placement of mutations on the mannosidase II 
LBD surface to characterise the nature of the site in the context of a cell. 

20 

The three dimensional structure of a new crystal may be modelled using molecular 
replacement. The term "molecular replacement" refers to a method that involves generating a 
preliminary model of a molecule or complex whose structural coordinates are unknown, by 
orienting and positioning a molecule whose structural coordinates are known within the unit 

25 cell of the unknown crystal, so as best to account for the observed diffraction pattern of the 
unknown crystal. Phases can then be calculated from this model and combined with the 
observed amplitudes to give an approximate Fourier synthesis of the structure whose 
coordinates are unknown. This, in turn, can be subject to any of the several forms of 
refinement to provide a final, accurate structure of the unknown crystal. Lattman, E., "Use of 

30 the Rotation and Translation Functions", in Methods in Enzymology, 115, pp. 55-77 (1985); 
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M. G. Rossmann, ed., "The Molecular Replacement Method", Int. Sci. Rev. Ser., No. 13, 
Gordon & Breach, New York, (1972). 

Commonly used computer software packages for molecular replacement are X-PLOR 
5 (Brunger 1992, Nature 355: 472-475), AMoRE (Navaza, 1994, Acta Crystallogr. A50:157- 
163), the CCP4 package (Collaborative Computational Project, Number 4, "The CCP4 Suite: 
Programs for Protein Crystallography", Acta Cryst., Vol. D50, pp. 760-763, 1994), the 
MERLOT package (P.M.D. Fitzgerald, J. Appl. Cryst., Vol. 21, pp. 273-278, 1988) and 
XTALVIEW (McCree et al (1992) J. Mol. Graphics 10: 44-46. It is preferable that the 
10 resulting structure not exhibit a root-mean-square deviation of more than 3 A. 

The quality of the model may be analysed using a program such as PROCHECK or 3D- 
Profiler [Laskowski et al 1993 J. Appl. Cryst. 26:283-291; Luthy R. et al, Nature 356: 83-85, 
1992; and Bowie, J.U. et al, Science 253: 164-170, 1991]. Once any irregularities have been 
15 resolved, the entire structure may be further refined. 

Other molecular modelling techniques may also be employed in accordance with this 
invention. See, e.g., Cohen, N. C. et al, "Molecular Modelling Software and Methods for 
Medicinal Chemistry", J. Med. Chem., 33, pp. 883-894 (1990), See also, Navia, M A. and M. 
20 A. Murcko, "The Use of Structural Information in Drug Design", Current Opinions in 
Structural Biology, 2, pp. 202-210 (1992). 

Using the structural coordinates of the crystal complexes provided by this invention, 
molecular modelling may be used to determine the structure coordinates of a crystalline 
25 mutant or homologue of mannosidase II LBD or of a related protein. By the same token, a 
crystal of the second aspect of the invention can be used to provide a model of swainsonine. 
Modelling techniques can then be used to approximate the three dimensional structure of 
swainsonine derivatives and other compounds which may be able to mimic the atomic 
contacts between swainsonine and the LBD. 
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COMPUTER FORMAT OF CRYSTALS/MODELS 

Information derivable from the crystal of the present invention (for example the structural 
5 coordinates) and/or a model of the present invention may be provided in a computer-readable 
format. 

Therefore, the invention provides a computer readable medium or a machine readable storage 
medium which comprises the structural coordinates of a mannosidase II including all or any 
10 parts of the mannosidase II (e.g ligand-binding domain), ligands including portions thereof, or 

O substrates including portions thereof. Such storage medium or storage medium encoded with 

, 

h these data are capable of displaying on a computer screen or similar viewing device, a three- 

j:L: dimensional graphical representation of a molecule or molecular complex which comprises 

IU the enzyme or ligand binding domains or similarly shaped homologous enzymes or ligand 

15 binding domains. Thus, the invention also provides computerized representations of a crystal 
of the invention, including any electronic, magnetic, or electromagnetic storage forms of the 

US data needed to define the structures such that the data will be computer readable for purposes 

12 of display and/or manipulation. 

20 In an aspect the invention provides a computer for producing a three-dimensional 
representation of a molecule or molecular complex, wherein said molecule or molecular 
complex comprises a mannosidase II or ligand binding domain thereof defined by structural 
coordinates of mannosidase II amino acids or a ligand binding domain thereof, or comprises 
structural coordinates of atoms of a ligand or substrate, or a three-dimensional representation 
25 of a homologue of said molecule or molecular complex, wherein said computer comprises: 

(a) a machine-readable data storage medium comprising a data storage material encoded 
with machine readable data wherein said data comprises the structural coordinates of a 
mannosidase II amino acids according to Table 1, 2, or 8 or a ligand binding domain 
thereof, or a ligand (e.g. swainsonine) according to Table 2, or Table 8; 
30 (b) a working memory for storing instructions for processing said machine-readable data; 
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(c) a central -processing unit coupled to said working memory and to said machine- 
readable data storage medium for processing said machine readable data into said 
three-dimensional representation; and 

(d) a display coupled to said central-processing unit for displaying said three-dimensional 
5 representation. 

A homologue may comprise a mannosidase II or ligand binding domain thereof, or ligand or 
substrate that has a root mean square deviation from the backbone atoms of not more than 1.5 
angstroms. 

10 

The invention also provides a computer for determining at least a portion of the structural 
coordinates corresponding to an X-ray diffraction pattern of a molecule or molecular complex 
wherein said computer comprises: 

(a) a machine-readable data storage medium comprising a data storage material 
15 encoded with machine readable data wherein said data comprises the structural 

coordinates according to Table 1, 2, or 8; 

(b) a machine-readable data storage medium comprising a data storage material 
encoded with machine readable data wherein said data comprises an X-ray 
diffraction pattern of said molecule or molecular complex; 

20 (c) a working memory for storing instructions for processing said machine-readable 

data of (a) and (b); 

(d) a central-processing unit coupled to said working memory and to said machine- 
readable data storage medium of (a) and (b) for performing a Fourier transform of 
the machine readable data of (a) and for processing said machine readable data of 

25 (b) into structural coordinates; and 

(e) a display coupled to said central-processing unit for displaying said structural 
coordinates of said molecule or molecular complex. 
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STRUCTURAL DETERMINATIONS 



The present invention also provides a method for determining the secondary and/or tertiary 
structures of a polypeptide by using a crystal, or a model according to the present invention. 
5 The polypeptide may be any polypeptide for which the secondary and or tertiary structure is 
uncharacterised or incompletely characterised. In a preferred embodiment the polypeptide 
shares (or is predicted to share) some structural or functional homology to the mannosidase II 
crystal. For example, the polypeptide may show a degree of structural homology over some 
or all parts of the primary amino acid sequence. For example the polypeptide may have one or 
10 more domains which shows homology with a mannosidase II domain (Kapitonov and Yu 
O (1999) Glycobiology 9(10): 961-978). 

j. 

^ The polypeptide may be a mannosidase II with a different specificity for a ligand or substrate. 

fU The polypeptide may be a mannosidase II which requires a different metal cofactor. 

15 Alternatively (or in addition) the polypeptide may be a mannosidase II from a different 
S! species. 

:~ The polypeptide may be a mutant of the wild-type mannosidase II. A mutant may arise 

O naturally, or may be made artificially (for example using molecular biology techniques). The 

20 mutant may also not be "made" at all in the conventional sense, but merely tested 

theoretically using the model of the present invention. A mutant may or may not be 

functional. 

Thus, using the model of the present invention, the effect of a particular mutation on the 
25 overall two and/or three dimensional structure of a mannosidase II and/or the interaction 
between the enzyme and a ligand or substrate can be investigated. Alternatively, the 
polypeptide may perform an analogous function or be suspected to show a similar catalytic 
mechanism to the mannosidase II enzyme. For example the polypeptide may remove, 
transport, or add on a sugar residue. 

30 
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The polypeptide may also be the same as the polypeptide of the crystal, but in association 
with a different ligand (for example, modulator or inhibitor) or cofactor. In this way it is 
possible to investigate the effect of altering a ligand or compound with which the polypeptide 
is associated on the structure of the LBD. 

5 

Secondary or tertiary structure may be determined by applying the structural coordinates of 
the crystal or model of the present invention to other data such as an amino acid sequence, X- 
ray crystallographic diffraction data, or nuclear magnetic resonance (NMR) data. Homology 
modeling, molecular replacement, and nuclear magnetic resonance methods using these other 
10 data sets are described below. 

Homology modeling (also known as comparative modeling or knowledge-based modeling) 
methods develop a three dimensional model from a polypeptide sequence based on the 
structures of known proteins (i.e. mannosidase II of the crystal). The method utilizes a 

15 computer model of the crystal of the present invention (the "known structure"), a computer 
representation of the amino acid sequence of the polypeptide with an unknown structure, and 
standard computer representations of the structures of amino acids. The method in particular 
comprises the steps of; (a) identifying structurally conserved and variable regions in the 
known structure; (b) aligning the amino acid sequences of the known structure and unknown 

20 structure (c) generating coordinates of main chain atoms and side chain atoms in structurally 
conserved and variable regions of the unknown structure based on the coordinates of the 
known structure thereby obtaining a homology model; and (d) refining the homology model 
to obtain a three dimensional structure for the unknown structure. This method is well known 
to those skilled in the art (Greer, 1985, Science 228, 1055; Bundell et al 1988, Eur. J. 

25 Biochem. 172, 513; Knighton et aL, 1992, Science 258:130-135, 
http://biochem.vt.edu/courses/modeling/homology.htn). Computer programs that can be used 
in homology modeling are Quanta and the Homology module in the Insight II modelling 
package distributed by Molecular Simulations Inc, or MODELLER (Rockefeller University, 
www.iucr.ac.uk/sinris-top/logical/prg-modeller.html). 

30 
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In step (a) of the homology modeling method, the known mannosidase II structure is 
examined to identify the structurally conserved regions (SCRs) from which an average 
structure, or framework, can be constructed for these regions of the protein. Variable regions 
(VRs), in which known structures may differ in conformation, also must be identified. SCRs 
5 generally correspond to the elements of secondary structure, such as alpha-helices and beta- 
sheets, and to ligand- and substrate-binding sites (e.g. acceptor and donor binding sites). The 
VRs usually lie on the surface of the proteins and form the loops where the main chain turns. 

Many methods are available for sequence alignment of known structures and unknown 

10 structures. Sequence alignments generally are based on the dynamic programming algorithm 

p of Needleman and Wunsch [J, Mol. Biol. 48: 442-453, 1970]. Current methods include 

^ FASTA, Smith-Waterman, and BLASTP, with the BLASTP method differing from the other 

^ two in not allowing gaps. Scoring of alignments typically involves construction of a 20x20 

ftp matrix in which identical amino acids and those of similar character (i.e., conservative 

J ; J 15 substitutions) may be scored higher than those of different character. Substitution schemes 

8i which may be used to score alignments include the scoring matrices PAM (Dayhoff et al., 

3 Meth. Enzymol. 91: 524-545, 1983), and BLOSUM (Henikoff and Henikoff, Proc. Nat. Acad, 

jj Sci. USA 89: 10915- 0919, 1992), and the matrices based on alignments derived from three- 

□ dimensional structures including that of Johnson and Overington (JO matrices) (J. Mol. Biol. 

*~ 20 233:716-738, 1993). 

Alignment based solely on sequence may be used; however, other structural features also may 
be taken into account. In Quanta, multiple sequence alignment algorithms are available that 
may be used when aligning a sequence of the unknown with the known structures. Four 
25 scoring systems (i.e. sequence homology, secondary structure homology, residue accessibility 
homology, CA-CA distance homology) are available, each of which may be evaluated during 
an alignment so that relative statistical weights may be assigned. 
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When generating coordinates for the unknown structure, main chain atoms and side chain 
atoms, both in SCRs and VRs need to be modeled. A variety of approaches known to those 
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skilled in the art may be used to assign coordinates to the unknown. In particular, the 
coordinates of the main chain atoms of SCRs will be transferred to the unknown structure. 
VRs correspond most often to the loops on the surface of the polypeptide and if a loop in the 
known structure is a good model for the unknown, then the main chain coordinates of the 
5 known structure may be copied. Side chain coordinates of SCRs and VRs are copied if the 
residue type in the unknown is identical to or very similar to that in the known structure. For 
other side chain coordinates, a side chain rotamer library may be used to define the side chain 
coordinates. When a good model for a loop cannot be found fragment databases may be 
searched for loops in other proteins that may provide a suitable model for the unknown. If 
10 desired, the loop may then be subjected to conformational searching to identify low energy 
Q confbrmers if desired. 

Once a homology model has been generated it is analyzed to determine its correctness. A 
K computer program available to assist in this analysis is the Protein Health module in Quanta 

ffl 15 which provides a variety of tests. Other programs that provide structure analysis along with 
^ output include PROCHECK and 3D-Profiler [Luthy R. et al, Nature 356: 83-85, 1992; and 

y} Bowie, J.U. et al, Science 253: 164-170, 1991]. Once any irregularities have been resolved, 

J X the entire structure may be further refined. Refinement may consist of energy minimization 

Q with restraints, especially for the SCRs. Restraints may be gradually removed for subsequent 

20 minimizations. Molecular dynamics may also be applied in conjunction with energy 

minimization. 

Molecular replacement involves applying a known structure to solve the X-ray 
crystallographic data set of a polypeptide of unknown structure. The method can be used to 

25 define the phases describing the X-ray diffraction data of a polypeptide of unknown structure 
when only the amplitudes are known. Thus in an embodiment of the invention, a method is 
provided for determining three dimensional structures of polypeptides with unknown structure 
by applying the structural coordinates of the crystal of the present invention to provide an X- 
ray crystallographic data set for a polypeptide of unknown structure, and (b) determining a 

30 low energy conformation of the resulting structure. 
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Molecular replacement computer programs generally involve the following steps: (1) 
determining the number of molecules in the unit cell and defining the angles between them 
(self rotation function); (2) rotating the known structure against diffraction data to define the 
5 orientation of the molecules in the unit cell (rotation function); (3) translating the known 
structure in three dimensions to correctly position the molecules in the unit cell (translation 
function); (4) determining the phases of the X-ray diffraction data and calculating an R-factor 
calculated from the reference data set and from the new data wherein an R-factor between 30- 
50% indicates that the orientations of the atoms in the unit cell have been reasonably 
10 determined by the method; and (5) optionally, decreasing the R-factor to about 20% by 
refining the new electron density map using iterative refinement techniques known to those 
skilled in the art (refinement). 

In an embodiment of the invention, a method is provided for determining three dimensional 
15 structures of polypeptides with unknown structure (e.g. additional native or mutated 
mannosidase II enzymes) by applying the structural coordinates of a mannosidase II structure 
to provide an X-ray crystallographic data set for a polypeptide of unknown structure, and (b) 
determining a low energy conformation of the resulting structure. 

20 The structural coordinates of the crystal of the present invention may be applied to nuclear 
magnetic resonance (NMR) data to determine the three dimensional structures of polypeptides 
with uncharacterised or incompletely characterised sturcture. (See for example, Wuthrich, 
1986, John Wiley and Sons, New York: 176-199; Pflugrath et al., 1986, J. Molecular Biology 
189: 383-386; Kline et al., 1986 J. Molecular Biology 189:377-382). While the secondary 

25 structure of a polypeptide may often be determined by NMR data, the spatial connections 
between individual pieces of secondary structure are not as readily determined. The structural 
coordinates of a polypeptide defined by X-ray crystallography can guide the NMR 
spectroscopist to an understanding of the spatial interactions between secondary structural 
elements in a polypeptide of related structure. Information on spatial interactions between 

30 secondary structural elements can greatly simplify Nuclear Overhauser Effect (NOE) data 
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from two-dimensional NMR experiments. In addition, applying the structural coordinates 
after the determination of secondary structure by NMR techniques simplifies the assignment 
of NOE's relating to particular amino acids in the polypeptide sequence and does not greatly 
bias the NMR analysis of polypeptide structure. 

5 

In an embodiment, the invention relates to a method of determining three dimensional 
structures of polypeptides with unknown structures, by applying the structural coordinates of 
a crystal of the present invention to nuclear magnetic resonance (NMR) data of the unknown 
structure. This method comprises the steps of: (a) determining the secondary structure of an 

10 unknown structure using NMR data; and (b) simplifying the assignment of through-space 
interactions of amino acids. The term " through-space interactions" defines the orientation of 
the secondary structural elements in the three dimensional structure and the distances between 
amino acids from different portions of the amino acid sequence. The term "assignment" 
defines a method of analyzing NMR data and identifying which amino acids give rise to 

15 signals in the NMR spectrum, 

SCREENING METHOD 

The present invention also provides a method of screening for a ligand that associates with a 
20 ligand binding domain and/or modulates the function of mannosidase II, by using a crystal or 
a model according to the present invention. The method may involve investigating whether a 
test compound is capable of associating with or binding a ligand binding domain. 

In accordance with an aspect of the present invention, a method is provided for screening for 
25 a ligand capable of binding to a ligand binding domain, wherein said method comprises the 
use of a crystal or model according to the invention. 
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In another aspect, the invention relates to a method of screening for a ligand capable of 
binding to a ligand binding domain, wherein the ligand binding domain is defined by the 
amino acid residue structural coordinates given herein, the method comprising contacting the 
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ligand binding domain with a test compound and determining if said test compound binds to 
said ligand binding domain. 

In one embodiment, the present invention provides a method of screening for a test compound 
capable of interacting with a key amino acid residue of the ligand binding domain of 
mannosidase II. 

Another aspect of the invention provides a process comprising the steps of: 

(a) performing the method of screening for a ligand as described above; 

(b) identifying one or more ligands capable of binding to a ligand binding domain; and 

(c) preparing a quantity of said one or more ligands. 

A further aspect of the invention provides a process comprising the steps of: 

(a) performing the method of screening for a ligand as described above; 

(b) identifying one or more ligands capable of binding to a ligand binding domain; and 

(c) preparing a pharmaceutical composition comprising said one or more ligands. 

Once a test compound capable of interacting with a key amino acid residue in a mannosidase 
II LBD has been identified, further steps may be carried out either to select and/or to modify 
compounds and/or to modify existing compounds, to modulate the interaction with the key 
amino acid residues in the mannosidase II LBD. 

Yet another aspect of the invention provides a process comprising the steps of: 

(a) performing the method of screening for a ligand as described above; 

(b) identifying one or more ligands capable of binding to a ligand binding domain; 

(c) modifying said one or more ligands capable of binding to a ligand binding 
domain; 

(d) performing said method of screening for a ligand as described above; 

(e) optionally preparing a pharmaceutical composition comprising said one or 
more ligands. 
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As used herein, the term "test compound" means any compound which is potentially capable 
of associating with a ligand binding domain. If, after testing, it is determined that the test 
compound does bind to the LBD, it is known as a "ligand". 

5 

A "test compound" includes, but is not limited to, a compound which may be obtainable from 
or produced by any suitable source, whether natural or not. The test compound may be 
designed or obtained from a library of compounds which may comprise peptides, as well as 
other compounds, such as small organic molecules and particularly new lead compounds. By 

10 way of example, the test compound may be a natural substance, a biological macromolecule, 
or an extract made from biological materials such as bacteria, fungi, or animal (particularly 
mammalian) cells or tissues, an organic or an inorganic molecule, a synthetic test compound, 
a semi-synthetic test compound, a carbohydrate, a monosaccharide, an oligosaccharide or 
polysaccharide, a glycolipid, a glycopeptide, a saponin, a heterocyclic compound, a structural 

15 or functional mimetic, a peptide, a peptidomimetic, a derivatised test compound, a peptide 
cleaved from a whole protein, or a peptides synthesised synthetically (such as, by way of 
example, either using a peptide synthesizer or by recombinant techniques or combinations 
thereof), a recombinant test compound, a natural or a non-natural test compound, a fusion 
protein or equivalent thereof and mutants, derivatives or combinations thereof. 

20 

The test compound may be screened as part of a library or a data base of molecules. Data 
bases which may be used include ACD (Molecular Designs Limited), NCI (National Cancer 
Institute), CCDC (Cambridge Crystallographic Data Center), CAST (Chemical Abstract 
Service), Derwent (Derwent Information Limited), Maybridge (Maybridge Chemical 
25 Company Ltd), Aldrich (Aldrich Chemical Company), DOCK (University of California in 
San Francisco), and the Directory of Natural Products (Chapman & Hall). Computer 
programs such as CONCORD (Tripos Associates) or DB-Converter (Molecular Simulations 
Limited) can be used to convert a data set represented in two dimensions to one represented in 
three dimensions. 
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Test compounds may be tested for their capacity to fit spatially into a mannosidsase II LBD. 
As used herein, the term "fits spatially" means that the three-dimensional structure of the test 
compound is accommodated geometrically in a cavity or pocket of the mannosidase II LBD. 
The test compound can then be considered to be a ligand. 

5 

A favourable geometric fit occurs when the surface areas of the test compound is in close 
proximity with the surface area of the cavity or pocket without forming unfavorable 
interactions. A favourable complementary interaction occurs where the test compound 
interacts by hydrophobic, aromatic, ionic, dipolar, or hydrogen donating and accepting forces. 
10 Unfavourable interactions may be steric hindrance between atoms in the test compound and 
Q atoms in the binding site. 

j£ If a model of the present invention is a computer model, the test compounds may be 

fit positioned in an LBD through computational docking. If, on the other hand, the model of the 

15 present invention is a structural model, the test compounds may be positioned in the LBD by, 
for example, manual docking. 

!^ As used herein the term "docking" refers to a process of placing a compound in close 

O proximity with a mannosidase II LBD, or a process of finding low energy conformations of a 

20 test compound/glycosyltransferase complex. 

A screening method of the present invention may comprise the following steps: 

(i) generating a computer model of a mannosidase II or a selected site thereof 
using a crystal according to the first aspect of the invention; 
25 (ii) docking a computer representation of a test compound with the computer 

model; 

(iii) analysing the fit of the compound in the mannosidase II or selected site. 



In an aspect of the invention a method is provided comprising the following steps: 
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(a) docking a computer representation of a structure of a test compound into a 
computer representation of a binding domain of a mannosidase II defined in 
accordance with the invention using a computer program, or by interactively 
moving the representation of the test compound into the representation of the 
binding domain; 

(b) characterizing the geometry and the complementary interactions formed between 
the atoms of the binding domain and the compound; optionally 

(c) searching libraries for molecular fragments which can fit into the empty space 
between the compound and binding domain and can be linked to the compound; 
and 

(d) linking the fragments found in (c) to the compound and evaluating the new 
modified compound. 

In an embodiment of the invention a method is provided which comprises the following steps: 

(a) docking a computer representation of a test compound from a computer data 
base with a computer representation of a selected site (e.g. the inhibitor 
binding domain) on a mannosidase II structure defined in accordance with the 
invention to obtain a complex; 

(b) determining a conformation of the complex with a favourable geometric fit and 
favourable complementary interactions; and 

(c) identifying test compounds that best fit the selected site as potential modulators 
of the mannosidase II. 

A method of the invention may be applied to a plurality of test compounds, to identify those 
that best fit the selected site. 

The model used in the screening method may comprise the ligand-binding domain of a 
mannosidase II enzyme either alone or in association with one or more ligands and/or 
cofactors. For example, the model may comprise the ligand-binding domain in association 
with a substrate or analogue thereof 
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If the model comprises an unassociated ligand binding domain, then the selected site under 
investigation may be the LBD itself. The test compound may, for example, mimic a known 
substrate for the enzyme in order to interact with the LBD. The selected site may 
5 alternatively be another site on the enzyme. 

If the model comprises an associated LBD, for example an LBD in association with a 
substrate molecule or analogue thereof, the selected site may be the LBD or a site made up of 
the LBD and the complexed ligand, or a site on the ligand itself. The test compound may be 
10 investigated for its capacity to modulate the interaction with the associated molecule. 

A test compound (or plurality of test compounds) may be selected on the basis of its similarity 
to a known ligand for the mannosidase II. For example, the screening method may comprise 
the following steps: 

15 (i) generating a computer model of the LBD of a mannosidase II in complex with 

a ligand; 

(ii) searching for a test compound with a similar three dimensional structure and/or 
similar chemical groups; and 

(iii) evaluating the fit of the test compound in the LBD. 

20 

Searching may be carried out using a database of computer representations of potential 
compounds, using methods known in the art. 

The present invention also provides a method for designing ligands for a mannosidase II. It is 
25 well known in the art to use a screening method as described above to identify a test 
compound with promising fit, but then to use this test compound as a starting point to design a 
ligand with improved fit to the model. A known modulator can also be modified to enhance 
its fit with a model of the invention. Such techniques are known as "structure-based ligand 
design" (See Kuntz et al., 1994, Acc. Chem. Res. 27:117; Guida, 1994, Current Opinion in 
30 Struc. Biol. 4: 777; and Colman, 1994, Current Opinion in Struc. Biol. 4: 868, for reviews of 
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structure-based drug design and identification;and Kuntz et al 1982, J. Mol. Biol. 162:269; 
Kuntz et al., 1994, Acc. Chem. Res. 27: 117; Meng et al., 1992, J. Compt. Chem. 13: 505; 
Bohm, 1994, J. Comp. Aided Molec. Design 8: 623 for methods of structure-based modulator 
design). 

5 

Examples of computer programs that may be used for structure-based ligand design are 
CAVEAT (Bartlett et al., 1989, in "Chemical and Biological Problems in Molecular 
Recognition", Roberts, S.M. Ley, S.V.; Campbell, N.M. eds; Royal Society of Chemistry: 
Cambridge, pp 182-196); FLOG (Miller et al., 1994, J. Comp. Aided Molec. Design 8:153); 
10 PRO Modulator (Clark et al., 1995 J. Comp. Aided Molec. Design 9:13); MCSS (Miranker 
and Karplus, 1991, Proteins: Structure, Function, and Genetics 8:195); and, GRID (Goodford, 
1985, J. Med. Chem. 28:849). 

The method may comprise the following steps: 
15 (i) docking a model of a test compound with a model of a selected site; 

(ii) identifying one or more groups on the test compound which may be modified 
to improve their fit in the selected site; 

(iii) replacing one or more identified groups to produce a modified test compound 
model; and 

20 (iv) docking the modified test compound model with the model of the selected site. 

Evaluation of fit may comprise the following steps: 

(a) mapping chemical features of a test compound such as by hydrogen bond donors or 
acceptors, hydrophobic/lipophilic sites, positively ionizable sites, or negatively 

25 ionizable sites; and 

(b) adding geometric constraints to selected mapped features. 



The fit of the modified test compound may then be evaluated using the same criteria. 
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The chemical modification of a group may either enhance or reduce hydrogen bonding 
interaction, charge interaction, hydrophobic interaction, Van Der Waals interaction or dipole 
interaction between the test compound and the key amino acid residue(s) of the selected site. 
Preferably the group modifications involve the addition, removal, or replacement of 
5 substituents onto the test compound such that the substituents are positioned to collide or to 
bind preferentially with one or more amino acid residues that correspond to the key amino 
acid residues of the selected site. 

Identified groups in a test compound may be substituted with, for example, alkyl, alkoxy, 
10 hydroxyl, aryl, cycloalkyl, alkenyl, alkynyl, thiol, thioalkyl, thioaryl, amino, or halo groups. 
Generally, initial substitutions are conservative, i.e., the replacement group will have 
approximately the same size, shape, hydrophobicity and charge as the original group. It 
should, of course, be understood that components known in the art to alter conformation 
should be avoided. 

15 

If a modified test compound model has an improved fit, then it may bind to the selected site 
and be considered to be a "ligand". Rational modification of groups may be made with the 
aid of libraries of molecular fragments which may be screened for their capacity to fit into the 
available space and to interact with the appropriate atoms. Databases of computer 
20 representations of libraries of chemical groups are available commercially, for this purpose. 

A test compound may also be modified "w situ" (i.e. once docked into the potential binding 
site), enabling immediate evaluation of the effect of replacing selected groups. The computer 
representation of the test compound may be modified by deleting a chemical group or groups, 

25 replacing chemical groups, or by adding a chemical group or groups. After each modification 
to a compound, the atoms of the modified compound and potential binding site can be shifted 
in conformation and the distance between the modulator and the active site atoms may be 
scored on the basis of geometric fit and favourable complementary interactions between the 
molecules. This technique is described in detail in Molecular Simulations User Manual, 1995 

30 in LUDI. 
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Examples of ligand building and/or searching computer include programs in the Molecular 
Simulations Package (Catalyst), ISIS/HOST, ISIS/BASE, and ISIS/DRAW (Molecular 
Designs Limited), and UNITY (Tripos Associates). 

5 

The "starting point" for rational ligand design may be a known ligand for the enzyme. For 
example, in order to identify potential modulators of the mannosidase II, a logical approach 
would be to start with a known ligand (for example a substrate molecule or inhibitor ) to 
produce a molecule which mimics the binding of the ligand. Such a molecule may, for 
10 example, act as a competitive inhibitor for the true ligand, or may bind so strongly that the 
interaction (and inhibition) is effectively irreversible. 

Such a method may comprise the following steps: 

(i) generating a computer model of a LBD of a mannosidase II in complex with a 
15 ligand; 

(ii) replacing one or more groups on the ligand model to produce a modified 
ligand; and 

(iii) evaluating the fit of the modified ligand in the LBD. 

20 The replacement groups could be selected and replaced using a compound construction 
program which replaces computer representations of chemical groups with groups from a 
computer database, where the representations of the compounds are defined by structural 
coordinates. 



25 In an embodiment, a screening method is provided for identifying a ligand of a mannosidase 
II comprising the step of using the structural coordinates of a substrate molecule or 
swainsonine or component thereof, defined in relation to its spatial association with a 
mannosidase II structure or a ligand binding domain of the invention, to generate a compound 
that is capable of associating with the mannosidase II or ligand binding domain. 
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In an embodiment of the invention, a screening method is provided for identifying a ligand of 
a mannosidase II comprising the step of using the structural coordinates of swainsonine listed 
in Table 2 or 8 to generate a compound for associating with a ligand binding domain of a 
mannosidase II as described herein. The following steps are employed in a particular method 
of the invention: (a) generating a computer representation of swainsonine, defined by its 
structural coordinates listed in Table 2 or 8; (b) searching for molecules in a data base that are 
structurally or chemically similar to the defined swainsonine, using a searching computer 
program, or replacing portions of the compound with similar chemical structures from a 
database using a compound building computer program. 

The screening methods of the present invention may be used to identify compounds or entities 
that associate with a molecule that associates with a mannosidase II enzyme (for example, a 
substrate molecule). 

Compounds and entities (e.g. ligands) of mannosidase II identified using the above-described 
methods may be prepared using methods described in standard reference sources utilized by 
those skilled in the art. For example, organic compounds may be prepared by organic 
synthetic methods described in references such as March, 1994, Advanced Organic 
Chemistry: Reactions, Mechanisms, and Structure, New York, McGraw Hill. 

Test compounds and ligands which are identified using a crystal or model of the present 
invention can be screened in assays such as those well known in the art. Screening can be, for 
example, in vitro, in cell culture, and/or in vivo. Biological screening assays preferably centre 
on activity-based response models, binding assays (which measure how well a compound 
binds to the receptor), and bacterial, yeast and animal cell lines (which measure the biological 
effect of a compound in a cell). The assays can be automated for high capacity-high 
throughput screening (HTS) in which large numbers of compounds can be tested to identify 
compounds with the desired activity. The biological assay, may also be an assay for the 
ligand binding activity of a compound that selectively binds to the LBD compared to other 
nuclear receptors. 
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LIGANDS/COMPOUNDS/MODULATORS 

The present invention provides a ligand or compound or entity identified by a screening 
5 method of the present invention. A ligand or compound may have been designed rationally 
by using a model according to the present invention. A ligand or compound identified using 
the screening methods of the invention specifically associate with a target compound. In the 
present invention the target compound may be the mannosidase II enzyme or a molecule that 
is capable of associating with the mannosidase II enzyme (for example a substrate molecule). 
10 In a preferred embodiment the ligand is capable of binding to the LBD of a mannosidase II. 

A ligand or compound identified using a screening method of the invention may act as a 
"modulator", i.e. a compound which affects the activity of a mannosidase II, A modulator 
may reduce, enhance or alter the biological function of a mannosidase II. For example a 
15 modulator may modulate the capacity of the enzyme to hydrolyse mannose residues. An 
alteration in biological function may be characterised by a change in specificity. For 
example, a modulator may cause the enzyme to accept a different substrate molecule, to 
transfer a different sugar, or to work with a different metal cofactor. In order to exert its 
function, the modulator commonly binds to the ligand binding domain. 

20 

A "modulator" which is capable of reducing the biological function of the enzyme may also 
be known as an inhibitor. Preferably an inhibitor reduces or blocks the capacity of the 
enzyme to hydrolyse mannose residues. The inhibitor may mimic the binding of a substrate 
molecule, for example, it may be a substrate analogue. A substrate analogue may be designed 

25 by considering the interactions between the substrate molecule and the enzyme (for example 
by using information derivable from the crystal of the invention) and specifically altering one 
or more groups (as described above). 

In a highly preferred embodiment a modulator acts as an inhibitor of the mannosidase 
II and is capable of inhibiting N-glycan biosynthesis. In another embodiment a modulator 

30 enhances mannosidase II activity and is capable of regulating the immune system. 
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The present invention also provides a method for modulating the activity of a mannosidase II 
within a cell using a modulator according to the present invention. It would be possible to 
monitor the expression of N-glycans on the cell surface following such treatment by a number 
5 of methods known in the art (for example by detecting expression with an N-glycan specific 
antibody). 

In another preferred embodiment, the modulator modulates the catalytic mechanism of the 
enzyme. 

10 

A modulator may be an agonist, partial agonist, partial inverse agonist or antagonist of the 
mannosidase II. 

As used herein, the term "agonist" means any ligand, which is capable of binding to a ligand 
15 binding domain and which is capable of increasing a proportion of the enzyme that is in an 
active form, resulting in an increased biological response. The term includes partial agonists 
and inverse agonists. 

As used herein, the term "partial agonist" means an agonist that is unable to evoke the 
20 maximal response of a biological system, even at a concentration sufficient to saturate the 
specific receptors. 

As used herein, the term "partial inverse agonist" is an inverse agonist that evokes a 
submaximal response to a biological system, even at a concentration sufficient to saturate the 
25 specific receptors. At high concentrations, it will diminish the actions of a full inverse 
agonist. 

The invention relates to a mannosidase II ligand binding domain antagonist, wherein said 
ligand binding domain is that defined by the amino acid structural coordinates described 
30 herein. For example the ligand may antagonise the inhibition of mannosidase by swainsonine. 
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As used herein, the term "antagonist" means any agent that reduces the action of another 
agent, such as an agonist. The antagonist may act at the same site as the agonist (competitive 
antagonism). The antagonistic action may result from a combination of the substance being 
5 antagonised (chemical antagonism) or the production of an opposite effect through a different 
receptor (functional antagonism or physiological antagonism) or as a consequence of 
competition for the binding site of an intermediate that links receptor activation to the effect 
observed (indirect antagonism). 

10 As used herein, the term "competitive antagonism" refers to the competition between an 
agonist and an antagonist for a receptor that occurs when the binding of agonist and 
antagonist becomes mutually exclusive. This may be because the agonist and antagonist 
compete for the same binding site or combine with adjacent but overlapping sites. A third 
possibility is that different sites are involved but that they influence the receptor 

15 macromolecules in such a way that agonist and antagonist molecules cannot be bound at the 
same time. If the agonist and antagonist form only short lived combinations with the receptor 
so that equilibrium between agonist, antagonist and receptor is reached during the presence of 
the agonist, the antagonism will be surmountable over a wide range of concentrations. In 
contrast, some antagonists, when in close enough proximity to their binding site, may form a 

20 stable covalent bond with it and the antagonism becomes insurmountable when no spare 
receptors remain. 

As mentioned above, an identified ligand or compound may act as a ligand model (for 
example, a template) for the development of other compounds. A modulator may be a 

25 mimetic of a ligand or ligand binding domain. A mimetic of a ligand may compete with a 
natural ligand for a mannosidase II and antogonize a physiological effect of the enzyme in an 
animal. A mimetic of a ligand may be an organically synthesized compound. A mimetic of a 
ligand binding domain, may be either a peptide or other biopharmaceutical (such as an 
organically synthesized compound) that specifically binds to a natural substrate molecule for 

30 a mannosidase II and antagonize a physiological effect of the enzyme in an animal. 
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A modulator may be one or a variety of different sorts of molecule. For example, a modulator 
may be a peptide, member of random peptide libraries and combinatorial chemistry-derived 
molecular libraries, phosphopeptide (including members of random or partially degenerate, 
directed phosphopeptide libraries), a carbohydrate, a monosaccharide, an oligosaccharide or 
polysaccharide, a glycolipid, a glycopeptide, a saponin, a heterocyclic compound antibody, 
carbohydrate, nucleoside or nucleotide or part thereof, and small organic or inorganic 
molecule. A modulator may be an endogenous physiological compound, or it may be a natural 
or synthetic compound. The modulators of the present invention may be natural or synthetic. 
The term "modulator" also refers to a chemically modified ligand or compound, and includes 
isomers and racemic forms. 

Once a ligand has been optimally selected or designed, substitutions may then be made in 
some of its atoms or side groups in order to improve or modify its binding properties. 
Generally, initial substitutions are conservative, i.e., the replacement group will have 
approximately the same size, shape, hydrophobicity and charge as the original group. It 
should, of course, be understood that components known in the art to alter conformation 
should be avoided. Such substituted chemical compounds may then be analyzed for efficiency 
of fit to the mannosidase II LBD by the same computer methods described above. 

Preferably, positions for substitution are selected based on the predicted binding orientation of 
a ligand to the mannosidase II LBD. 

A technique suitable for preparing a modulator will depend on its chemical nature. For 
example, organic compounds may be prepared by organic synthetic methods described in 
references such as March, 1994, Advanced Organic Chemistry: Reactions, Mechanisms, and 
Structure, New York, McGraw Hill. Peptides can be synthesized by solid phase techniques 
(Roberge JY et al (1995) Science 269: 202-204) and automated synthesis may be achieved, 
for example, using the ABI 43 1 A Peptide Synthesizer (Perkin Elmer) in accordance with the 
instructions provided by the manufacturer. Once cleaved from the resin, the peptide may be 
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purified by preparative high performance liquid chromatography (e.g., Creighton (1983) 
Proteins Structures and Molecular Principles, WH Freeman and Co, New York NY). The 
composition of the synthetic peptides may be confirmed by amino acid analysis or sequencing 
(e.g., the Edman degradation procedure; Creighton, supra), 

5 

If a modulator is a nucleotide, or a polypeptide expressable therefrom, it may be synthesized, 
in whole or in part, using chemical methods well known in the art (see Caruthers MH et al 
(1980) Nuc Acids Res Symp Ser 215-23, Horn T et al (1980) Nuc Acids Res Symp Ser 225- 
232), or it may be prepared using recombinant techniques well known in the art. 

10 

Direct synthesis of a ligand or mimetics thereof can be performed using various solid-phase 
techniques (Roberge JY et al (1995) Science 269: 202-204) and automated synthesis may be 
achieved, for example, using the ABI 43 1 A Peptide Synthesizer (Perkin Elmer) in 
accordance with the instructions provided by the manufacturer. Additionally, the amino acid 
15 sequences obtainable from the ligand, or any part thereof, may be altered during direct 
synthesis and/or combined using chemical methods with a sequence from other subunits, or 
any part thereof, to produce a variant ligand. 

In an alternative embodiment of the invention, the coding sequence of a ligand or mimetics 
20 thereof may be synthesized, in whole or in part, using chemical methods well known in the art 
(see Caruthers MH et al (1980) Nuc Acids Res Symp Ser 215-23, Horn T et al (1980) Nuc 
Acids Res Symp Ser 225-232). 

A wide variety of host cells can be employed for expression of the nucleotide sequences 
25 encoding a ligand of the present invention. These cells may be both prokaryotic and 
eukaryotic host cells. Suitable host cells include bacteria such as £. coli, yeast, filamentous 
fungi, insect cells, mammalian cells, typically immortalized, e.g., mouse, CHO, human and 
monkey cell lines and derivatives thereof Preferred host cells are able to process the 
expression products to produce an appropriate mature polypeptide. Processing includes but is 
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not limited to glycosylation, ubiquitination, disulfide bond formation and general post- 
radiational modification. 

In an embodiment of the present invention, the ligand may be a derivative of, or a chemically 
5 modified ligand. The term "derivative" or "derivatised" as used herein includes the chemical 
modification of a ligand. 

A chemical modification of a ligand and/or a key amino acid residue of a ligand binding 
domain of the present invention may either enhance or reduce hydrogen bonding interaction, 
10 charge interaction, hydrophobic interaction, Van Der Waals interaction or dipole interaction 
between the ligand and the key amino acid residue(s) of the mannosidase II LBD. By way of 
example, steric hinderance is a common means of changing the interaction of the 
mannosidase II LBD binding domain with the activation domain. 

15 Preferably such modifications involve the addition of substituents onto a test compound such 
that the substituents are positioned to collide or to bind preferentially with one or more amino 
acid residues that correspond to the key amino acid residues of mannosidase II LBD of the 
present invention. Typical modifications may include, for example, the replacement of a 
hydrogen by a halo group, an alkyl group, an acyl group or an amino group. 

20 

The invention also relates to classes of modulators of mannosidase II based on the structure 
and shape of a substrate, defined in relation to the substrate's molecule's spatial association 
with a mannosidase II structure of the invention or part thereof. Therefore, a modulator may 
comprise a substrate molecule having the shape or structure, preferably the structural 
25 coordinates, of a substrate molecule in the active site binding pocket of a reaction catalyzed 
by a mannosidase II. In an embodiment, the substrate comprises GlcNAcMansGlcNAc2-Asn- 

A modulator may be an inhibitor of a mannosidase II such as swainsonine or a derivative or 
30 mimetic thereof. 
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A class of modulators of mannosidase II enzymes may comprise a compound containing a 
structure of swainsonine, and having one or more, preferably all, of the structural coordinates 
of swainsonine of Table 2 or 8. Functional groups in the swainsonine modulators may be 
5 substituted with, for example, alkyl, alkoxy, hydroxyl, aryl, cycloalkyl, alkenyl, alkynyl, thiol, 
thioalkyl, thioaryl, amino, or halo, or they may be modified using techniques known in the art. 
Substituents will be selected to optimize the activity of the modulator. 

PHARMACEUTICAL COMPOSITION 

10 

The present invention also provides the use of a ligand or modulator according to the 
invention, in the manufacture of a medicament to treat and/or prevent a disease in a 
mammalian patient. There is also provided a pharmaceutical composition comprising such a 
ligand or modulator and a method of treating and/or preventing a disease comprising the step 
15 of administering such a modulator or pharmaceutical composition to a mammalian patient. 

In an embodiment, the invention relates to a pharmaceutical composition which comprises a 
crystal structure of the invention or a part thereof (e.g. a binding domain), or a modulator of 
the invention in an amount effective to regulate one or more of the conditions described 
20 herein (e.g. tumor growth or metastasis) and a pharmaceutical^ acceptable carrier, diluent or 
excipient. 

The pharmaceutical compositions may be for human or animal usage in human and veterinary 
medicine and will typically comprise a pharmaceutically acceptable carrier, diluent, excipient, 
25 adjuvant or combination thereof. 

Acceptable carriers or diluents for therapeutic use are well known in the pharmaceutical art, 
and are described, for example, in Remington's Pharmaceutical Sciences, Mack Publishing 
Co. (A. R. Gennaro edit. 1985). The choice of pharmaceutical carrier, excipient or diluent 
30 can be selected with regard to the intended route of administration and standard 
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pharmaceutical practice. The pharmaceutical compositions may comprise as - or in addition 
to - the carrier, excipient or diluent any suitable binder(s), lubricant(s), suspending agent(s), 
coating agent(s), solubilising agent(s). 

5 A pharmaceutical composition of the invention can be administered to a subject in an 
appropriate carrier or diluent, co-administered with enzyme inhibitors or in an appropriate 
carrier such as microporous or solid beads or liposomes. Liposomes include water-in-oil-in- 
water emulsions as well as conventional liposomes (Strejan et al., (1984) J. Neuroimmunol 
7:27). 

10 

Preservatives, stabilizers, dyes and even flavouring agents may be provided in the 
pharmaceutical composition. Examples of preservatives include sodium benzoate, sorbic acid 
and esters of p-hydroxybenzoic acid. Antioxidants and suspending agents may also be used. 

15 The routes for administration (delivery) include, but are not limited to, one or more of: oral 
(e.g. as a tablet, capsule, or as an ingestable solution), topical, mucosal (e.g. as a nasal spray 
or aerosol for inhalation), nasal, parenteral (e.g. by an injectable form), gastrointestinal, 
intraspinal, intraperitoneal, intramuscular, intravenous, intrauterine, intraocular, intradermal, 
intracranial, intratracheal, intravaginal, intracerebroventricular, intracerebral, subcutaneous, 

20 ophthalmic (including intravitreal or intracameral), transdermal, rectal, buccal, vaginal, 
epidural, sublingual. 

Where the pharmaceutical composition is to be delivered mucosally through the 
gastrointestinal mucosa, it should be able to remain stable during transit through the 
25 gastrointestinal tract; for example, it should be resistant to proteolytic degradation, stable at 
acid pH and resistant to the detergent effects of bile. 

It is to be understood that not all of the agent need be administered by the same route. 
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Where appropriate, the pharmaceutical compositions can be administered by inhalation, in the 
form of a suppository or pessary, topically in the form of a lotion, gel, hydrogel, solution, 
cream, ointment or dusting powder, by use of a skin patch, orally in the form of tablets 
containing excipients such as starch or lactose or chalk, or in capsules or ovules either alone 
5 or in admixture with excipients, or in the form of elixirs, solutions or suspensions containing 
flavouring or colouring agents, or they can be injected parenterally, for example 
intravenously, intramuscularly or subcutaneously. For parenteral administration, the 
compositions may be best used in the form of a sterile aqueous solution which may contain 
other substances, for example enough salts or monosaccharides to make the solution isotonic 
10 with blood. The aqueous solutions should be suitably buffered (preferably to a pH of from 3 
p to 9), if necessary. The preparation of suitable parenteral formulations under sterile 

;*J conditions is readily accomplished by standard pharmaceutical techniques well-known to 

P those skilled in the art. 

••■^ 
s 

j j 15 If the agent of the present invention is administered parenterally, then examples of such 

s». administration include one or more of: intravenously, intra-arterially, intraperitoneal^, 

^ intrathecally, intraventricularly, intraurethrally, intrasternally, intracranially, intramuscularly 

j 1 ^ or subcutaneously administering the agent; and/or by using infusion techniques. 

s '~ 20 For buccal or sublingual administration the compositions may be administered in the form of 
tablets or lozenges which can be formulated in a conventional manner. 

The tablets may contain excipients such as microcrystalline cellulose, lactose, sodium citrate, 
calcium carbonate, dibasic calcium phosphate and glycine, disintegrants such as starch 
25 (preferably corn, potato or tapioca starch), sodium starch glycollate, croscarmellose sodium 
and certain complex silicates, and granulation binders such as polyvinylpyrrolidone, 
hydroxypropylmethylcellulose (HPMC), hydroxypropylcellulose (HPC), sucrose, gelatin and 
acacia. Additionally, lubricating agents such as magnesium stearate, stearic acid, glyceryl 
behenate and talc may be included. 

30 
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Solid compositions of a similar type may also be employed as fillers in gelatin capsules. 
Preferred excipients in this regard include lactose, starch, cellulose, milk sugar or high 
molecular weight polyethylene glycols. For aqueous suspensions and/or elixirs, the agent 
may be combined with various sweetening or flavouring agents, colouring matter or dyes, 
5 with emulsifying and/or suspending agents and with diluents such as water, ethanol, 
propylene glycol and glycerin, and combinations thereof. 



As indicated, a therapeutic agent of the present invention can be administered intranasally or 
by inhalation and is conveniently delivered in the form of a dry powder inhaler or an aerosol 

10 spray presentation from a pressurised container, pump, spray or nebuliser with the use of a 

fl suitable propellant, e.g. dichlorodifluoromethane, trichlorofluoromethane, 

^3 dichlorotetrafluoroethane, a hydrofluoroalkane such as 1,1,1,2-tetrafluoroethane (HFA 

'ffi 134A™) or 1,1,1,2,3,3,3-heptafluoropropane (HFA 227EA™), carbon dioxide or other 

^ suitable gas. In the case of a pressurised aerosol, the dosage unit may be determined by 

IV 15 providing a valve to deliver a metered amount. The pressurised container, pump, spray or 

s : :ra 

gJ nebuliser may contain a solution or suspension of the active compound, e.g. using a mixture 

; T:! of ethanol and the propellant as the solvent, which may additionally contain a lubricant, e.g. 

i'fj sorbitan trioleate. Capsules and cartridges (made, for example, from gelatin) for use in an 

fj inhaler or insufflator may be formulated to contain a powder mix of the agent and a suitable 

20 powder base such as lactose or starch. 



Therapeutic administration of polypeptide modulators may also be accomplished using gene 
therapy. A nucleic acid including a promoter operatively linked to a heterologous polypeptide 
may be used to produce high-level expression of the polypeptide in cells transfected with the 
25 nucleic acid. DNA or isolated nucleic acids may be introduced into cells of a subject by 
conventional nucleic acid delivery systems. Suitable delivery systems include liposomes, 
naked DNA, and receptor-mediated delivery systems, and viral vectors such as retroviruses, 
herpes viruses, and adenoviruses. 



30 



APPLICATIONS 
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The modulators and compositions of the invention may be used to modulate the biological 
activity of a mannosidase II in a cell, including modulating a pathway in a cell regulated by 
5 the mannosidase II or modulating a mannosidase II with inappropriate activity in a cellular 
organism. In addition, a mannosidase II structure of the invention may be used to devise 
protocols to modulate the biological activity of a mannosidase II in a cell. 

Cellular assays, as well as animal model assays in vivo, may be used to test the activity of a 
10 potential modulator of a mannosidase II as well as diagnose a disease associated with 
inappropriate mannosidase II activity. In vivo assays are also useful for testing the bioactivity 
of a potential modulator designed by the methods of the invention. 

The invention further provides a method of treating a mammal, the method comprising 
15 administering to a mammal a modulator or pharmaceutical composition of the present 
invention. 

Typically, a physician will determine the actual dosage which will be most suitable for an 
individual subject and it will vary with the age, weight and response of the particular patient 
20 and severity of the condition. The dosages below are exemplary of the average case. There 
can, of course, be individual instances where higher or lower dosage ranges are merited. 

The specific dose level and frequency of dosage for any particular patient may be varied and 
will depend upon a variety of factors including the activity of the specific compound 

25 employed, the metabolic stability and length of action of that compound, the age, body 
weight, general health, sex, diet, mode and time of administration, rate of excretion, drug 
combination, the severity of the particular condition, and the individual undergoing therapy. 
By way of example, the pharmaceutical composition of the present invention may be 
administered in accordance with a regimen of 1 to 10 times per day, such as once or twice per 

30 day. 
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For oral and parenteral administration to human patients, the daily dosage level of the agent 
may be in single or divided doses. 

5 The modulators (e.g. inhibitors) identified using the methods of the invention may be useful 
in the treatment and prophylaxis of tumor growth and metastasis of tumors. Anti-metastatic 
effects of inhibitors can be demonstrated using a lung colonization assay. For example, 
melanoma cells treated with an inhibitor may be injected into mice and the ability of the 
melanoma cells to colonize the lungs of the mice may be examined by counting tumor 
10 nodules on the lungs after death. Suppression of tumor growth in mice by the inhibitor 
p administered orally or intravenously may be examined by measuring tumor volume. 

if. An inhibitor identified using the invention may have particular application in the prevention 

i q of tumor recurrence after surgery i.e. as an adjuvant therapy. 

^ 15 

Hi An inhibitor may be especially useful in the treatment of various forms of neoplasia such as 

^ leukemias, lymphomas, melanomas, adenomas, sarcomas, and carcinomas of solid tissues in 

jj^" patients. In particular, inhibitors can be used for treating malignant melanoma, pancreatic 

p cancer, cervico-uterine cancer, ovarian cancer, cancer of the kidney such as metastatic renal 

5 ~ 20 cell carcinoma, stomach, lung, rectum, breast, bowel, gastric, liver, thyroid, head and neck 
cancers such as unresectable head and neck cancers, lymphangitis carcinamatosis, cancers of 
the cervix, breast, salivary gland, leg, tongue, lip, bile duct, pelvis, mediastinum, urethra, 
bronchogenic, bladder, esophagus and colon, non-small cell lung cancer, and Karposi's 
Sarcoma which is a form of cancer associated with HIV-infected patients with Acquired 
25 Immune Deficiency Syndrome (AIDS). The inhibitors may also be used for other anti- 
proliferative conditions such as bacterial and viral infections, in particular AIDS. 

An inhibitor identified in accordance with the present invention may be used to treat 
immunocompromised subjects. For example, they may be used in a subject infected with 
30 HIV, or other viruses or infectious agents including bacteria, fungi, and parasites, in a subject 
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undergoing bone marrow transplants, and in subjects with chemical or tumor-induced immune 
suppression. 

Inhibitors may be used as hemorestorative agents and in particular to stimulate bone marrow 
5 cell proliferation, in particular following chemotherapy or radiotherapy. The 
myeloproliferative activity of an inhibitor of the invention may be determined by injecting the 
inhibitor into mice, sacrificing the mice, removing bone marrow cells and measuring the 
ability of the inhibitor to stimulate bone marrow proliferation by directly counting bone 
marrow cells and by measuring clonogenic progenitor cells in methylcellulose assays. The 
10 inhibitors can also be used as chemoprotectants, and in particular to protect mucosal 
epithelium following chemotherapy. 

An inhibitor identified in accordance with the invention also may be used as an antiviral agent 
in particular on membrane enveloped viruses such as retroviruses, influenza viruses, 
15 cytomegaloviruses and herpes viruses. An inhibitor may also be used to treat bacterial, fungal, 
and parasitic infections. An inhibitor may also be used in the treatment of inflammatory 
diseases such as rheumatoid arthritis, asthma, inflammatory bowel disease, and 
atherosclerosis. 

20 An inhibitor may also be used to augment the anti-cancer effects of agents such as interleukin- 
2 and poly-IC, to augment natural killer and macrophage tumoricidal activity, induce cytokine 
synthesis and secretion, enhance expression of LAK and HLA class I specific antigens; 
activate protein kinase C, stimulate bone marrow cell proliferation including hematopoietic 
progenitor cell proliferation, and increase engraftment efficiency and colony-forming unit 

25 activity, to confer protection against chemotherapy and radiation therapy (e.g. 
chemoprotective and radioprotective agents), and to accelerate recovery of bone marrow 
cellularity particularly when used in combination with chemical agents commonly used in the 
treatment of human diseases including cancer and acquired immune deficiency syndrome 
(AIDS). For example, an inhibitor can be used as a chemoprotectant in combination with anti- 
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cancer agents including doxorubicin, 5-fluorouracil 7 cyclophosphamide, and methotrexate, 
and in combination with isoniazid or NS AID. 

Alpha-mannosidosis may also be amendable to treatment or prophylaxis by the 
method of the present invention. 

The loss of mannosidase II has been found to alter N-glycan branching and attenuate 
the immune system's ability to maintain self-tolerance (Chui et al„ PNAS 98(3): 1142- 1147, 
2001V Therefore, the structures, modulators, compositions, and methods of the invention may 
be useful in the treatment or prophylaxis of autoimmune disease including systemic lupus 
erythematosus. 

The present invention ihus provides a method for treating the above-mentioned 
conditions in a subject comprising administering to a subject an effective amount of a 
modulator of the invention. The invention also contemplates a method for stimulating or 
inhibiting tumor growth or metiistasis in a subject comprising administering to a subject an 
effective amount of a modulator of the invention. 

The following non-limiting examples are illustrative of the present invention. 
EXAMPLES 
Example 1 

Drosophila Mannosidase II preparation and structure determination 
Expression Plasmids 

Constructs designed to expressed dGMII in Drosophila Schneider (S2) cells were based on 
the DES expression system available from IriVitrogen with extensive modifications. 
Expression plasmids were conslructed which had the dGMII under the control either of the 
inducible metallothioneine (MX) promoter or the strong constitutive actin 5.1 promoter 
(AC5). Amino terminal purification tags were inserted in place of the C-tenninal tags in the 
commercially available vectors. Earlier attempts, to truncate the mouse enzyme from at the C- 
terminus resulted in inactive protein, as had also been noted with the GlcNAc-transferases. 
Thus, it was elected to keep the C-tenninus free. Expression vectors were created with either a 
6His-tag., for purification on metal chelate columns such as Ni-NTA (Qiageri) or cobalt based 
Talon columns (Ctontech), or with a Strep-tag for purification on streptavidin-Sepharose. 
These affinity tags are initially non-cleavable and add approximately 8-10 residues to the end 
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of the protein. Finally, constructs were made either lacking or containing the Bip secretion 
sequence to direct the expressed protein into the cells or medium respectively. 

Blasticidin Selection 

Initial attempts at stable transfection with the recommended hygromycin selection system 
were unsuccessful. Therefore a new selection plasmid, pCopBlast was created which encodes 
blasticidin S deaminase under the control of the constitutive copia promoter. Blasticidin S 
has been used for stable transfectants of mammalian and plant cells, as well as yeast. 
Commercially available control plasmids expressing MT-induced secreted green fluorescent 
protein (GFP), or constitutive and MT-induced unsecreted bacterial p-galactosidase (LacZ) 
were used to test the suitability of blasticidin selection in S2 cells, and to optimize conditions 
for transfection, selection, and metallothionein induction. Stable transfectants could be 
selected with 16 ^ig/ml blasticidin in Schneider's S2 medium containing 10% fetal bovine 
serum. Copper and cadmium were the only metals found to activate the MT promoter; copper 
favoured internally expressed proteins and cadmium, secreted proteins. Maintenance of the 
altered phenotype was also demonstrated for many weeks in the absence of the selective 
pressure of blasticidin demonstrating that these were indeed stably transfected cell lines. 

Creation of Stably Expressing dGMII cell lines. 

Starting with the pProtA expression plasmid from initial published studies [Rabouille et al, 
1999], the mannosidase coding region was excised, and inserted into an in-frame EcoRI site 
immediately at the end of the affinity tag in the new plasmids. The position of a unique 3 ' 
restriction site outside the coding region meant that 100-200 bp of extra sequence was added 
between the stop codon and the SV40 polyadenylation site. This extra sequence was 
removed with a short PCR amplification using a unique internal restriction site. Both ends of 
the constructs were sequenced to verify proper reading frame and lack of PCR errors. The 
resulting constructs consist of the dGMII catalytic region with a short length of the stalk 
region, in a variety of "flavours" of promoter, affinity tag, and expression location. 
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Co-transfection of the pCopBlast selection plasmid with the mannosidase expression 
plasmids, followed by selection for blasticidin resistance allowed stable expressing cell lines 
after approximately one month. Mannosidase activity was measured using PNP-mannoside, 
in a microtitre plate assay. Protein was detected on Western blots using anti-PentaHis 
5 antibody (Qiagen). Only the secreted products showed activity, with similar levels in the 
constitutive and MT-promoter constructs. No difference in mannosidase activity was seen 
between His or Strep tagged protein. All subsequent work was carried out with the secreted 
constructs. 

Insect cells do not grow at low population densities. Therefore, the initial population of 
selected cells was a mixed population with each cell in the culture having somewhat different 
levels of incorporated expression plasmid. To select individual cells with high levels of 
expression the stably transfected population was diluted to single cells in a 50:50 mix of 
conditioned medium and fresh medium with blasticidin. These were then plated in 96-well 
culture plates. After five weeks, about 10% of the wells showed growths of colonies large 
enough to transfer, of which roughly 30% had activity. The highest expressors had 
approximately 5 times the activity of the initial population in the MT-inducible strains. High- 
expressing clones of the constitutively expressed dGMII, were obtained suggesting that the 
continued production mannosidase by the cells may be detrimental, especially under the 
stressful conditions of single-cell selection. 

Expression and purification of dGMII. 

The availability of a stable clones expressing considerable amounts of mannosidase allowed 
optimization of induction, expression and purification conditions. In contrast to mammalian 
25 cells, insect cells are not highly adherent and will grow to high cell densities in a variety of 
culture vessels including roller bottles, spinners, fermentors and shake flasks. No CO2 is 
required, and temperatures in the range of 25-28°C are optimal. With stably transfected cells, 
the difficulties that accompany baculoviral infection do not arise. 




10 
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Initial experiments were carried out in S2 medium containing 10% bovine serum. Metal 
concentrations used to induce and time of induction were optimized for dGMII production. 
10-20 |xM cadmium proved optimal for induction. Although copper (at approximately 500- 
1000 ^M) is generally used in the literature for induction, the sensitivity of dGMII to 
inhibition by copper (IC50 = 25piM,[26]) precluded its use. Cadmium has been reported to be 
detrimental to the growth of cells. However, at the concentrations used here, the cells 
continued to grow and maintain greater than 90% viability (as assessed by Trypan blue 
exclusion) until the end of the induction period. Cells were maintained in the continous 
presence of cadmium for up to three passages. 



As the dGMII was secreted into the medium, it was badly contaminated with bovine serum 
albumin (BSA). Attempts to remove the impurity by Blue Agarose or Ni-NTA 
chromatography were unsuccessful. To circumvent this contamination problem a number of 
serum-free media were evaluated for growth and expression levels. There are very few 
15 serum-free media developed for Drosophila cells so ones that have been used with 
baculovirus expression systems were evaluated. Ultimately the Excel420 medium from JRH 
Biosciences was successful. 

A further advantage to this medium is the incorporation of seleno-methionine in place of 
20 methionine for cry stallo graphic phasing purposes. A custom preparation of this medium was 
purchased from JRH free of Met and Cu. Inclusion of 50 jig/ml of SeMet resulted in the 
production of protein with high enough incorporation (approximately 50% by mass 
spectrometry) for accurate phasing. 

25 Cells were adapted to serum-free growth by gradual dilution with CCM3 medium and then 
they were switched into the other media for the expression studies. Excel420, CCM3 and 
SFX-Insect were clearly superior for maintaining healthy growth, though CCM3 provided 
slightly lower levels of expression. Levels of cadmium required for induction were optimized 
for each medium and were considerably lower than those required in S2 medium. For 

30 unknown reasons, constitutive expression of dGMII was much lower in serum-free medium. 
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Therefore, all subsequent scale-up and purifications were carried out with the MT-inducible 
6His tagged constructs. 

To scale-up protein expression cells were first grown as suspensions in spinner cultures. 
5 These were subsequently put into 2.8 litre Fernbach flasks (1 litre Excel 420/flask) shaken at 
100 rpm at 28°C Cells were induced for 72 hours with 10 cadmium. After this time the 
medium was asceptically harvested and the cells are placed in the same volume of fresh 
medium for a further round of induction. This can be repeated at least one more time without 
significant cell death or loss of protein expression. Based on activity measurements up to 50 
10 mg/litre of medium can be expressed every three days. This is approximately 1000 fold 
P greater than in initial expression experiments in CHOP cells [Rabouille et al 9 1999]. This 

k n procedure requires about 2 weeks of dedicated time in an incubator/shaker. 

fU Purification is effected by batch binding first to Blue-Agarose, with elution by 350 mM NaCl, 

ffl 15 and then to Ni-NTA resin, with elution by 50 mM imidizole. Initial, secreted protein from the 
^ medium of the serum-free grown cells was loaded in batch to Blue-Agarose. The beads were 

'"•Q. then loaded into a column and washed with 20 column volumes of 50 mM NaCl in 20 mM 

| I Tris pH8. The majority of the mannosidase was eluted with 350 mM NaCl. This pooled eluant 

| ! f was loaded onto NiNTA, washed with low imidizole, and eluted with 50 mM imidizole to 

20 achieve crystallization purity. The protein is then dialysed extensively against 10 mM Tris, 
pH 8.3 and 100 mM NaCl and concentrated (to greater than 20 mg/ml) for crystallization 
trials. All crystallization has been carried out from a single protein preparation. 

Crystallization 

25 Crystals of Drosophila Mannosidase II and complexes of the enzyme with various inhibitors 
were grown at room temperature using vapor diffusion and micro-batch crystallization 
techniques. Crystals were obtained under a wide variety of conditions. Polyethylene glycol 
(PEG) was used as a precipitant (with sizes: 4000; 6000; 8000; 10000; and 20000) at 
concentrations varying from 5-20%, in the presence of 5% 2,4-methyl-pentanediol (MPD) or 

30 0-30% glycerol. Crystallization solutions were buffered at pH 7-7.5 using 100 mM buffer 
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solutions of Tris, Hepes or Mes. The crystals belong to the orthorhombic space group P2 1 2 1 2 j 
with cell dimensions: a=69A; b-1 10A; C-139A; <x=90°; (5=90°; 7=90°. For the initial structure 
determination Seleno-Methionine-derivatized Mannosidase II crystals were grown in 8.5% 
PEG 6000, 5% MPD and 100 mM Tris pH 7.0, using micro seeds obtained from wild-type 
5 enzyme crystals. Data were collected from crystals that were frozen in liquid nitrogen after a 
stepwise increase of the MPD concentration in the crystallization solution from 5% to 25%. 

A crystal of the invention is illustrated in the Figures. In particular, Figure 1 shows the active 
site of a mannosidase II. Figure 2 shows the secondary structure of Drosophila Golgi a- 

10 mannosidase II. Helices are in blue and p sheets are in red. Figure 3 shows the Drosophila 
golgi a-mannosidase II molecule with the colours representing where it is identical to human 
GMII. The red and blue represent deletions or insertions with respect to the human sequence. 
The green is a disulphide bond. Figure 4 shows the whole Drosophila golgi a-mannosidase II 
molecule in sticks with residues that are identical in the lysosomal manll as coloured balls 

15 (red or blue depending whether they are in the N-terminal or C-terminal part of the molecule). 
Figure 5 shows the active site of a Drospholiga mannosidase. Figure 6 shows the DNA 
sequence of an expressed Drosophila mannosidase. Figure 7 shows an alignment of expressed 
secreted Drosophila mannosidase with human mannosidase. 

20 Example 2 

Experimental Procedures 

Protein Overexpression and Purification 

Expression, purification and crystallization of the dGMII will be described in detail 
elsewhere. Briefly, the cDNA was inserted behind an inducible promoter, and used to stably 
25 transfect Drosophila S2 cells. Single cell clones secreting high levels of dGMII were chosen 
and adapted to serum-free medium. Unlabelled dGMII was isolated from the supernatants of 
cells grown in Fernbach flasks by batch binding to Blue-Agarose (Sigma). The protein was 
eluted from the Blue-Agarose using NaCl and further purified by Ni-NTA chromatography 
(Qiagen). EDTA (5 mM) was added to scavenge any free nickel. The protein was extensively 
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dialyzed against 10 mM Tris pH 8 containing 100 mM NaCl, concentrated to 25 mg/ml, and 
stored in aliquots at -80 °C. 

For seleno-methionine labeling, a custom batch of Ex-Cell 420 (#006140E JRH Biosciences, 
5 Lenexa KS) was used which lacked any added methionine or copper. Cells were grown to 
high cell density in a spinner flask in standard medium, resuspended in the "methionine-free" 
medium and allowed to starve for 4 hours prior to the addition of 50 mg/1 of seleno- 
methionine (Sigma). After 70 hrs of induction the protein was purified from the supernatant as 
outlined above except that 5 mM P-mercaptoethanol was present throughout the purification. 

10 

m Crystallization and Data Collection 

kQ Crystals of Drosophila Mannosidase II and complexes of the enzyme with various inhibitors 

m were grown at room temperature using vapor diffusion and micro-batch crystallization 

^ techniques. Crystals were obtained under a wide variety of conditions. Polyethylene glycol 

m 15 (PEG) was used as a precipitant (with sizes: 4000; 6000; 8000; 10000; and 20000) at 
|U concentrations varying from 5-20%, in the presence of 5% 2,4-methyl-pentane-diol (MPD) or 

U3 0-30% glycerol. Crystallization solutions were buffered at pH 7-7.5 using 100 mM buffer 

i ; I solutions of Tris, Hepes or Mes. The crystals belong to the orthorhombic space group P2i2i2j 

with cell dimensions: a=69A; b=l 10A; C-139A; ot=90°; p-90°; y=90\ For the initial structure 
20 determination Seleno-Methionine-derivatized Mannosidase II crystals were grown in 8.5% 
PEG 6000, 5% MPD and 100 mM Tris pH 7.0, using micro seeds obtained from wild-type 
enzyme crystals. Data were collected from crystals that were frozen in liquid nitrogen after a 
stepwise increase of the MPD or glycerol concentration in the crystallization solution from 
5% to 25%. Data collection was performed at the Advanced Photon Source facility at 
25 Argonne National Laboratories, Argonne, Illinois. Beam line BM14D was used for collection 
of multiple wavelength anomalous dispersion data and BM14C for collection of high- 
resolution data. 



Structure Determination 



10 
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The structure of uncomplexed dGMII was determined by MAD phasing at the Selenium 
absorption edge with datasets collected at an absorption peak wavelength of 0.9786 A, 
inflection wavelength of 0.9790A and a remote wavelength of 0.9770 A. Initial positions of 
26 out of 28 Selenium atoms were determined with the program Solve (Terwilliger et al., 
1987) with an initial Figure of Merit (FOM) of 0.67. The experimental map obtained after 
density modification, using the program DM of the CCP4 program package (Cowtan, 1994), 
showed continuous density of very high quality for the whole molecule. The structure was 
traced using the program O (Jones et al., 1991) using the density modified experimental map. 
The model was refined using the program CNS (Briinger et al., 1998). 



O Inductively Coupled Plasma Atomic Emission Spectroscopy (ICP-AES) 

uh; The metal content in dGMII samples was analyzed by inductively coupled plasma atomic 

^ emission spectroscopy using the ICP-AES model 'Optima 3000 DV (Dual View) from 

RJ Perkin Elmer. The zinc content in the protein samples was determined relative to an 

ifg 15 equivalent amount of dGMII assay buffer. 

RESULTS AND DISCUSSION 

^ Protein expression 

20 The cDNA for Drosophila GMII is predicted to encode a protein of 1 108 amino acids. For 
protein expression in Drosophila cells the first 75 amino acids consisting of the cytosolic and 
transmembrane domains and most of the stalk region were eliminated. The remaining cDNA 
was cloned in-frame behind a secretion signal. 

25 Numbering of our construct starts at the point where the expressed protein is expected to be 
cleaved, by signal peptidase, from the secretion signal. Three extra amino terminal residues, a 
6-histidine tag, and a glycine, glutamine and phenylalanine were added in cloning. The first 
aspartate (D13) of the construct corresponds to aspartate 76 of the native protein. The first 
residue seen in the structure (C31) corresponds to C94, and the final residue SI 044 to SI 107, 

30 of the full-length sequence. 
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Structure Determinations 

The structure of Drosophila Golgi ot-mannosidase II has been determined by the multi- 
wavelength anomalous dispersion (MAD) phasing method using a data set collected from a 
5 crystal of Seleno-methionine derivatized enzyme (Table 9). This is the first reported structure 
of a Se-Met substituted enzyme produced in a Drosophila overexpression system. The native 
dGMII structure has been refined to a resolution of 1.76A with some data to 1.4A resolution 
(see refinement statistics presented in Table 10). The model contains residues 31-1044 of the 
recombinant enzyme (numbered as described above), as well as a zinc ion, an N-glycan 

10 residue, a molecule of the cryo-protectant, 2-methyl-2,4-pentanediol (MPD), and a 
tris(hydroxymethyl)-aminomethane (Tris) molecule. The presence of the enzyme-bound zinc 
ion was confirmed by Inductively Coupled Plasma Atomic Emission Spectroscopy (ICP- 
AES). The final structure of the dGMII-swainsonine complex has been refined at 1.87A 
resolution and the dGMII-DMNJ complex to 1.69A resolution, with some data to 1.5 A 

15 resolution. 

Overall Architecture of dGMII 

The structure of dGMII reveals a previously unobserved protein fold consisting of an N- 
terminal a/p domain, a three-helical bundle and an all-P C-terminal domain forming a single 

20 compact entity, connected by 5 internal disulfide bonds and stabilized by a zinc binding site 
(Figure 8B). The oval shaped molecule has two distinct faces (Figure 8C). The N-terminal 
face of the molecule is convex, whereas the opposing face of the enzyme has a planar surface. 
N-terminal residue Cys-3 1 is the last residue of the so-called stalk region, the linkage between 
the catalytic domain and the transmembrane domain. Cys-3 1 is located at the convex face of 

25 the molecule, indicating that this surface of the molecule presumably faces the inner side of 
the Golgi membrane, while the planar surface, containing the active site cavity (see below), 
faces the Golgi lumen. 

The N-terminal a/p domain is comprised of an inner core of three P-sheets (A, B and C, 
30 Figure 8B) consisting of 11, mostly parallel P-strands, surrounded by 16 a-helices. This 
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domain contains a GlcNAc residue found in the electron density map at a consensus N- 
glycosylation site (Asn-194), located at the N-terminus of helix 7. The a/p domain is 
stabilized by three disulfide bonds: between Cys-31 and Cys-1032 connecting the N and C- 
terminal extremes of dGMII; Cys-275 and Cys-282 linking helices 10 and 11; Cys-283 and 
5 Cys-297 linking helix 1 1 with a loop between helix 13 and the core of parallel (3-sheets. The 
cysteines forming the latter two disulfide-bonds are conserved in the human Golgi ct- 
mannosidase II sequence. 

The C-terminal half of the protein contains a three-helix bundle, comprised of helices 18, 20 
10 and 21 , and is connected to the N-terminal a/(i-domain via a zinc binding site. The zinc ion is 
[a coordinated in a Ts-square-based pyramidal geometry involving residues: Asp-90, His-92, 

Asp-204 and His-471. Furthermore, the C-terminal domain contains two immunoglobulin-like 
13 domains: a small p-sandwich consisting of 12 anti-parallel strands from P-sheets D and E, and 

ifi a large 21 -strand structure involving p-sheets F and G. 

m i 5 

O A barrel formed by the three-helix bundle and helix-23 together with the two p-sandwich 

III structures result in a narrow pore in the center of the C-terminal domain. The pore is lined by 

six arginine residues: Arg-540, 565, 617, 770, 777 and 893, contributing to the overall 
M : positive charge of the pore (Figure 9A). A hairpin loop, connecting two strands of P-sheet D 

20 (Figure 8B and C, residues 527-540, shown in yellow) protrudes into the center of the barrel 
on the planar side of the molecule. Arginine residue 530, located at the tip of the type-I p-turn 
in this loop, plugs the pore preventing an open channel through the protein. The resulting 
crater-like cavity on the convex side of the molecule is 20A deep, with a diameter of 20A 
funneling to 8A at the bottom of the cavity. B-factor values of residues within the loop 
25 indicate a higher degree of flexibility compared to the rest of the structure (average B-factor 
values: -33A 2 and -15 A 2 , respectively). 
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Active Site 

The molecular surface representation of the planar face of dGMII reveals an extended pocket 
in the N-terminal ot/p-domain, formed primarily by acidic residues (Figure 9B). These same 
5 residues form the core of a large, contiguous, surface-exposed patch, of highly conserved 
amino acids, in comparison with the human GMII sequence (Figure 9C). The active site of the 
enzyme is located in a small cavity in the side of this conserved, negatively charged region. 
The cavity is lined by aromatic residues Trp-95, Phe-206, Tyr-269 and Tyr-727, which are 
involved in hydrophobic and hydrogen-bond interactions with a bound Tris molecule in the 
10 unliganded structure (Figure 10A). Tris is known to inhibit dGMII activity (Rabouille et al., 
1999). Additional hydrophobic and hydrogen bond interactions are observed with Asp-92 and 
Asp-204. At the open side of the cavity the Tris molecule hydrogen bonds with Arg-228, Tyr- 
269 and Asp-341 (not shown) via water molecules. 

15 A key feature of the active site is the coordination of the zinc ion by the Tris hydroxyl group 
02. In the enzyme-Tris complex the zinc ion is bound in a Ts-square-based pyramidal 
geometry, coordinated by the OD1 oxygen moieties of aspartate residues 92 and 204; the NE2 
nitrogens of histidines 90 and 471; and the hydroxyl oxygen 02 of the bound Tris molecule, 
as represented in Figure 10A. The T 5 geometry is further stabilized by hydrogen bonds 

20 between the zinc coordinating atoms and the existence of H-bonds between the ND1 nitrogen 
atoms of the histidines 90 and 471 with the carbonyl oxygen of seleno-methionine 167 and a 
water molecule, respectively (not shown). The presence of these, so called, 'elec-His-Zn 
motifs 1 is believed to increase the basicity and the ligand strength of the histidine and arrange 
it correctly for interaction with the metal (Alberts et al., 1998). In an uninhibited enzyme, Tris 

25 would likely be replaced by a coordinating water molecule. As discussed below, this 
arrangement has implications for substrate binding and transition state stabilization. 

The occurrence of zinc in Family 38 glycosyl hydrolases has been described by Snaith (1975) 
in Jack-bean ot-mannosidase. A possible role for zinc in catalysis was indicated by 
30 inactivation of the enzyme by chelating agents and bivalent metal ions such as Cu 2+ . Copper 
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has also been shown to effectively inactivate Drosophila and mouse GMII (Rabouille et al., 
1999). 

Inhibitor Binding 

5 The structures of dGMII in complex with the inhibitors DMNJ and swainsonine show that 
both compounds bind to the same active site in a similar manner (Figure 10B and C). The 
binding of both inhibitors involves a large contribution of hydrophobic interactions involving 
aromatic residues Trp-95, Phe-206 and Tyr-727, forming the walls of the cavity. The inhibitor 
ring structures are stacked against Trp-95, a feature seen in several carbohydrate binding and 

10 hydrolyzing proteins (see Boraston et al., 2000 and review papers therein), and stabilized by 
hydrogen bonds and interactions with the zinc ion. In the complexes of dGMII with either 
DMNJ or swainsonine the T 5 geometry of the bound zinc ion, as seen in the Tris-bound 
enzyme, is transformed into T6-octahedral coordination. In both the dGMII complexes the 
inhibitor 02 hydroxyl oxygen replaces the 02 oxygen of Tris and the 03 hydroxyl oxygen 

15 forms the apex of the second pyramid. In order to obey the restraints of the T 6 geometry, the 
plane of the swainsonine ring structure is tilted with respect to the saccharide-like ring of the 
bound DMNJ molecule. This enables the formation of a hydrogen bond between the zinc- 
coordinating OD1 oxygen of Asp-204 and the N4 nitrogen at the fusion of the five and six- 
membered rings of swainsonine. As in the Tris-bound enzyme, the zinc coordinating oxygen 

20 atoms of the inhibitors are involved in hydrogen bond interactions with the neighboring metal 
binding residues of the enzyme. 

The position of the DMNJ and swainsonine molecules is stabilized in the active site by 
hydrogen bonds between carboxylic oxygens OD1 and OD2 of residue Asp-472 and hydroxyl 
25 oxygens 03 and 04 (05 in swainsonine) of the inhibitors, analogous to the 01 and 02 
interactions seen in the enzyme-Tris complex. As in the Tris-bound enzyme, DMNJ is 
involved in additional hydrogen bonds, via water molecules, with the NH2 nitrogen of Arg- 
228 ? the hydroxyl oxygen of Tyr-269, the backbone carbonyl oxygen of Arg-876 (not shown) 
and the OD1 oxygen of Asp-204. 

30 
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The displacement of the Tris molecule by either of the inhibitors only slightly affects the zinc 
binding site by weakening the internal hydrogen bonds between Asp-204 and histidines 90 
and 471. No major conformational changes are observed between the Tris-bound and the 
inhibitor-bound mannosidase molecules as their backbones are virtually superimposable, with 
5 root-mean-square-deviations between Ca atoms of 0.068A (dGMII-DMNJ complex) and 
0.087A (dGMII-swainsonine complex). 

Catalytic mechanism 

Golgi ot-mannosidase II is a retaining mannosyl hydrolase, which cleaves the linkage between 
10 the CI atom of M7 and M6 (Figure 8A) and, respectively, the 03 and 06 atom of the al,6- 
linked mannosyl branch (M4) of GlcNAcMansGlcNAc2. The catalytic mechanism is proposed 
to follow a very similar path to the corresponding retaining f$-glycosidases (Braun et al., 
1995; White and Rose, 1997). This is a two-stage reaction that usually involves two 
carboxylic acids, one acting as a nucleophile attacking the glycosidic bond, and the other as a 
15 general acid/base catalyst. Nucleophilic attack of one carboxylic acid results in glycosylation 
of the enzyme by forming a covalent intermediate followed by a second deglycosylation step, 
each step passing through an oxocarbonium ion-like transition state. 

Based on the structure of the dGMII-inhibitor complexes we speculate that the mannose 
20 residues on the al,6-linked mannosyl branch (M4) bind to the enzyme at the same site and in 
the same manner as mannose-like inhibitor DMNJ. Coordination of the zinc ion with the 02 
and 03 hydroxyl oxygens thereby contributes to the enzyme's specificity for mannose. Four 
acidic amino acid residues, Asp-92, Asp-204, Asp-341 and Asp-472, are candidates for 
catalytic side chains based on their proximity to the active site (Figure IOC). Results from a 
25 recent study on the mechanism of catalysis in Jack-bean ct-mannosidase by Withers and co- 
workers, using reagents that trap the glycosyl-enzyme intermediate, identified an aspartate 
residue as the catalytic nucleophile in that enzyme (Howard et al., 1998). Comparison of the 
highly conserved sequence region surrounding this aspartate in Jack-bean a-mannosidase 
with the same sequence region in dGMII suggests that aspartate residue 204 in dGMII is the 
30 catalytic nucleophile that attacks the glycosidic linkage. For this reaction it is required that 
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Asp-204 is close to the anomeric carbon of the mannose substrate. In the dGMII-DMNJ 
complex, however, the equivalent anomeric carbon is located 4.6A from the nucleophile. 
Binding of the C2 and C3 substituent hydroxyl oxygens of the flattened five-membered ring 
in swainsonine causes the inhibitor molecule to tilt, bringing its bridgehead nitrogen N4, in 
5 the analogous position to CI in the substrate, significantly closer to the putative nucleophilic 
Asp-204 (3. 2 A). This tilted binding mode, stabilized by a hydrogen bond between N4 and 
Asp-204 and by van der Waals stacking interactions between the 6-membered ring of 
swainsonine and Phe-206, may resemble the mode of binding of the ring-flattened transition 
state mannosyl cation. Thus, Phe-206 would stabilize the transition state by compensating for 
10 the loss of stacking interactions of the substrate with Trp-95. The highly complementary 
3 shape of swainsonine with the active site of dGMII, and its structural analogy with the skewed 

H boat transition state conformation, could therefore explain its 10,000 times higher binding 

affinity for the enzyme, compared to the substrate-mimic DMNJ (data not shown). 

'if 

^ 15 The OD1 oxygen of Asp-204, the putative nucleophile, directly coordinates the zinc ion, 
implicating a role for the zinc in positioning the nucleophile and in the stabilization of 
protonation states of the reacting partners. It is tempting to speculate that the change of zinc 
coordination from T5 to the less favored T6 state (Alberts et al., 1998) on substrate binding 
may also contribute to the mechanism. From the Tris and DMNJ structures, it is predicted that 
20 the coordination would revert to T5 on product release. If so, this transition may energetically 
facilitate the deglycosylation step. Such evidence of direct zinc involvement in the catalytic 
mechanism of a glycosyl hydrolase is unprecedented. Arg-288 positions Asp-204 for 
nucleophilic attack by virtue of hydrogen bond interactions between its NE and NH2 
nitrogens and the OD2 oxygen of Asp-204 (Figure 10C). Based on the expected distance 
25 between the two catalytic residues (-5.5A, Davies and Henrissat, 1995) likely candidates for 
the catalytic base are Asp-341 and Asp-472 (preliminary indications are that the D341N 
mutant is catalytically inactive, DAK unpublished results). Recent data suggest that other 
residues, such as tyrosines, possibly play a role in glycosidic bond cleavage (Davies and 
Henrissat, 1995). Tyrosine residues 269 are 727 are positioned to help stabilize the transition 
30 state. 
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Substrate Binding and Cleavage 

The function of GMII is dependent on the presence of pl,2-GlcNAc (G3, Figure 8A), added 
to al,3-linked mannose (M5) by GlcNAc transferase I (see reviews: Kornfeld and Kornfeld, 
5 1985; Moremen et al., 1994). This pi,2-GlcNAc dependence suggests the presence of an 
additional saccharide-binding site in GMII. Evidence for such a binding site is provided by 
the observation of an MPD molecule in the structure of dGMII, in the vicinity of the active 
site cavity. MPD was used as a cryo-protectant during the procedure of flash-freezing of the 
crystal, prior to data collection (see experimental procedures). The replacement of MPD by 
10 the alternative cryo-protectant glycerol resulted in the occupation of this same position by a 

\i glycerol molecule. Glycerol has been shown to mimic saccharide binding in structures of 

=j3 glycosyl hydrolases (Schmidt et al., 1 998, Vallee et al., 2000). 

jj^ The observation of the binding of MPD and glycerol near dGMII's active site (Figure 11 A) 

ffl 15 enables a hypothesis regarding the binding and cleavage of al,6 and al,3-linked mannoses 

Si 

^ on the <xl,6-linked mannose branch of the GlcNAcMan5GlcNAc 2 oligosaccharide. In this 

jSt hypothesis, the MPD binding site is suggested to be the putative site of interaction for (31,2- 

H ; GlcNAc (G3, Figure 8 A), enabling anchoring of the oligosaccharide substrate in the 

Si's? 

1^- conserved negatively charged pocket. In Figure 1 IB a model is shown of a 

20 GlcNAcMansGlcNAc2 structure with the pi,2-GlcNAc residue placed in the MPD binding 
site and the al,6-linked M6 mannose docked into the active site, with its hydroxy 1 oxygens 
02 and 03 coordinating the zinc ion. As required, the asparagine linked (31,4-GlcNAc 
residues Gl and G2 extend away from the surface of the molecule (into the Golgi lumen). 
Both M4 and the second substrate al,3-linked M7 mannose are located within the conserved 
25 negatively charged pocket pointing away from the active site cavity. In this orientation it can 
be easily visualized that after cleavage of the al,6-Iinked M6 the second, al,3-linked M7 can 
be brought into the active site cavity by a -180° rotation, through the extended pocket, around 
the flexible ctl,6-Iinkage of M4 (see Figure 1 1C). In addition to the dependence of GMIFs 
action on the presence of the G3 pi,2-GlcNAc, this model provides a mechanism for the 
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cleavage of both mannose residues without major conformational change of the enzyme, and 
more importantly, without release of the polypeptide-carbohydrate complex, anchored by the 
stationary GlcNAc, between the two cleavage events. Finally, this model suggests that the 
ccl,6-linked M6 mannose is preferentially cleaved first, enabling the shorter al,3-linked M7 
residue to rotate through the pocket with minimal steric hindrance; according to our model, 
the proposed 'swivel' mechanism would be slightly hampered should the M7 mannose be 
cleaved first. This is supported by data reported for a-mannosidase II from mung bean 
seedlings, Xenopus liver, Rat liver Golgi and for enzyme-activity in homogenates of insect 
cells, showing preferential hydrolytic activity on the M6 mannosyl residue (Kaushal et al., 
1990; Altmann and Martz, 1995; Ren et al., 1997). 
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Conclusions 

The structure of the catalytic domain of Golgi a-mannosidase II provides the basis for its zinc 
ion mediated specificity for mannose, as well as insight into its reaction mechanism. In 
addition, the result illustrates the structural basis for the mechanism of inhibition by the anti- 
5 cancer agent swainsonine, which we propose mimics aspects of the transition state binding. 
This understanding is critical for the rational design of swainsonine variants and/or novel 
mechanism-based compounds as specific a-mannosidase II inhibitors, for the treatment of 
several forms of cancer. A bound MPD molecule identifies a putative GlcNAc binding 
pocket, located near the active site and enables a hypothesis explaining the enzyme's 
10 dependency on the single GlcNAc substitution of the GlcNAcMan 5 GlcNAc 2 substrate for 
binding. Furthermore, it suggests a novel mechanism for successive hydrolysis of the a 1,6 
and al,3-Iinked mannose residues, resulting in the tri-mannose core glycosyl structure. 
Finally, it opens the door to the design of novel highly specific inhibitors linking together 
functional sites in the enzyme. 

15 

Various modifications and variations of the described methods and system of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the 
invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly 
20 limited to such specific embodiments. Indeed, various modifications of the described modes 
for carrying out the invention which are obvious to those skilled in chemistry or biology or 
related fields are intended to be covered by the present invention. All publications mentioned 
in the above specification are herein incorporated by reference. 
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Table 1 

Structural coordinates of a Drosophila Golgi a-mannosidase II. 

REMARK coordinates from simulated annealing refinement 

REMARK refinement resolution: 500.0 - 1.4 A 

REMARK starting r= 0.1816 free_r= 0.2003 

REMARK final r= 0.18 94 free_r= 0.2063 

REMARK rmsd bonds- 0.004594 rmsd angles= 1.32379 

REMARK wa_initial= 0.264577 wa_dynamics= 0.28954 wa_final= 0.28836 
REMARK target= mlf md-method= torsion annealing schedule- slowcool 
REMARK starting temperature^ 1000 total md steps= 40 * 6 

REMARK sg= P2 ( 1 ) 2 ( 1 ) 2 ( 1 ) a- 68.865 b= 109.718 c= 138.599 alpha= 90 beta- 90 
gamma- 90 

REMARK parameter file 1 : CNS_TOPPAR:protein_rep. param 

REMARK parameter file 2 : CNSJTOPPAR: water_rep. param 

REMARK parameter file 3 : CNSJTOPPAR: ion. param 

REMARK parameter file 4 : trs.par 

REMARK parameter file 5 : mpd.par 

REMARK parameter file 6 : cisjpeptide .param 

REMARK parameter file 7 : CNSJTOPPAR: carbohydrate . param 

REMARK molecular structure file: dgmllcgen . mtf 

REMARK input coordinates: dgmllcgen .pdb 

REMARK reflection file= . . /semetHiR. cv 

REMARK ncs= none 

REMARK B-correction resolution: 6.0 - 1.4 
REMARK initial B-factor correction applied to fobs : 
REMARK Bll= 0.609 B22= -0.765 B33= 0.155 
REMARK B12= 0.000 B13= 0.000 B23= 0.000 

REMARK B-factor correction applied to coordinate array B: 0.042 
REMARK bulk solvent: density level- 0.35999 e/A A 3, B-factor= 42.8385 A A 2 
REMARK reflections with | Fobs | /sigma_F < 0.0 rejected 
REMARK reflections with | Fobs I > 10000 * rms(Fobs) rejected 
REMARK theoretical total number of refl. in resol . range: 206243 (100.0%) 
REMARK number of unobserved reflections (no entry or |F|=0): 59797 (29.0%) 
REMARK number of reflections rejected: 0 (0.0%) 
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Table 2 

Structural coordinates of a Drosophila Golgi ct-mannosidase II with swainsonine. 

REMARK coordinates from restrained individual B-factor refinement 

5 REMARK refinement resolution: 500.0 - 1.87 A 

REMARK starting r= 0.1835 free_r= 0.2089 

REMARK final r= 0.1801 free__r= 0.2084 

REMARK B rmsd for bonded mainchain atoms= 0.707 target= 1.5 

REMARK B rmsd for bonded sidechain atoms= 1.139 target^ 2.0 

10 REMARK B rmsd for angle mainchain atoms= 1.167 target= 2.0 

REMARK B rmsd for angle sidechain atoms= 1.765 target^ 2.5 

REMARK wa= 1.14241 

REMARK rweight=0. 2694 4 5 

REMARK targe t= mlf steps= 30 

15 REMARK sg= P2 ( 1) 2 ( 1) 2 (1 ) a- 68.902 b= 110.015 c= 138.472 alpha= 90 beta= 90 

gamma = 90 



CNS_TOPPAR : prot ein_rep . param 
CNS_TOPPAR : wat er_rep . param 
CNS_TOPPAR: ion. param 
swainsonine2 .par 
. . /zntrmp/mpd.par 
cis_peptide . param 
CNS TOPPAR: carbohydrate .param 



REMARK parameter file 1 

REMARK parameter file 2 

REMARK parameter file 3 
20 REMARK parameter file 4 

REMARK parameter file 5 

REMARK parameter file 6 

REMARK parameter file 7 

REMARK molecular structure file: swainsoninegen.mtf 
25 REMARK input coordinates: swainsonine_ann_l . pdb 

REMARK reflection file= dgm2native_re jmerge . cv 

REMARK ncs= none 

REMARK B-correction resolution: 6.0 - 1.87 
REMARK initial B-factor correction applied to fobs : 
30 REMARK Bll= 0.513 B22- -0.085 B33= -0.428 
REMARK B12= 0.000 B13= 0.000 B23= 0.000 

REMARK B-factor correction applied to coordinate array B: -0.306 
REMARK bulk solvent: density level= 0.353213 e/A"3, B-factor- 42.0423 A"2 
REMARK reflections with | Fobs | /sigma_F < 0.0 rejected 
35 REMARK reflections with | Fobs [ > 10000 * rms(Fobs) rejected 
REMARK theoretical total number of refl. in resol. range: 
% ) 

REMARK number of unobserved reflections (no entry or |F1=0) 
REMARK number of reflections rejected: 
40 REMARK total number of reflections used: 

REMARK number of reflections in working set: 
REMARK number of reflections in test set: 
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OH2 


WAT 


w 


961 


23. 


884 


82. 


876 


-3. 


966 


1 


.00 


52. 


89 


w 




ATOM 


9144 


OH2 


WAT 


w 


962 


44 . 


483 


40. 


961 


7. 


504 


1 


.00 


44 . 


52 


w 




ATOM 


9145 


OH2 


WAT 


w 


963 


35. 


403 


62. 


493 


33. 


962 


1 


.00 


31. 


32 


w 


10 


ATOM 


9146 


OH2 


WAT 


w 


964 


27. 


262 


90. 


077 


-31. 


713 


1 


.00 


39. 


79 


w 




ATOM 


9147 


OH2 


WAT 


w 


965 


74 . 


769 


77 . 


373 


-20. 


732 


1 


.00 


39. 


11 


w 




ATOM 


9148 


OH2 


WAT 


w 


966 


49. 


100 


42. 


041 


8. 


300 


1 


.00 


33. 


85 


w 




ATOM 


9149 


OH2 


WAT 


w 


967 


44. 


902 


92. 


910 


-25. 


044 


1 


.00 


36. 


65 


w 




ATOM 


9150 


OH2 


WAT 


w 


968 


52. 


476 


49. 


145 


25. 


620 


1 


.00 


35. 


34 


w 


15 


ATOM 


9151 


OH 2 


WAT 


w 


969 


9. 


649 


53. 


663 


-12. 


194 


1 


.00 


30. 


57 


w 




ATOM 


9152 


OH 2 


WAT 


w 


970 


58. 


733 


51. 


116 


14. 


538 


1 


.00 


47. 


36 


w 




ATOM 


9153 


OH 2 


WAT 


w 


971 


51. 


884 


50. 


452 


27. 


694 


1 


.00 


38. 


68 


w 




ATOM 


9154 


OH2 


WAT 


w 


972 


25. 


020 


89. 


138 


-25. 


139 


1 


.00 


38. 


86 


w 




ATOM 


9155 


OH2 


WAT 


w 


973 


7. 


521 


58. 


222 


0. 


201 


1 


.00 


37. 


34 


w 


20 


ATOM 


9156 


OH2 


WAT 


w 


974 


11. 


524 


58. 


491 


19. 


987 


1 


.00 


37. 


19 


w 




ATOM 


9157 


OH2 


WAT 


w 


975 


18. 


598 


60. 


943 


-37. 


781 


1 


.00 


25. 


28 


w 




ATOM 


9158 


OH2 


WAT 


w 


976 


46. 


,002 


60. 


113 


-31. 


315 


1 


.00 


37. 


62 


w 




ATOM 


9159 


OH 2 


WAT 


w 


977 


5. 


,368 


57. 


731 


6. 


783 


1 


.00 


33. 


43 


w 




ATOM 


9160 


OH2 


WAT 


w 


978 


13. 


,342 


66. 


496 


-11. 


623 


1 


.00 


59. 


42 


w 


25 


ATOM 


9161 


OH2 


WAT 


w 


979 


47. 


,205 


81. 


657 


-17. 


119 


1 


.00 


33. 


31 


w 




ATOM 


9162 


OH2 


WAT 


w 


981 


29. 


,413 


65. 


677 


12. 


426 


1 


.00 


21. 


92 


w 




ATOM 


9163 


OH2 


WAT 


w 


982 


28. 


,559 


69. 


915 


12. 


822 


1 


.00 


31. 


51 


w 




ATOM 


9164 


OH2 


WAT 


w 


983 


33. 


,312 


68. 


983 


16. 


916 


1 


.00 


38. 


02 


w 




ATOM 


9165 


CI 


NAG 


c 


1 


58. 


, 321 


45. 


027 


12. 


880 


1 


.00 


46. 


25 


c 


30 


ATOM 


9166 


C2 


NAG 


c 


1 


59. 


,553 


44. 


726 


13. 


744 


1 


.00 


48. 


,64 


c 




ATOM 


9167 


N2 


NAG 


c 


1 


60. 


, 611 


45. 


671 


13. 


441 


1 


.00 


50. 


02 


c 




ATOM 


9168 


C7 


NAG 


c 


1 


60. 


,818 


46. 


715 


14. 


237 


1 


.00 


51. 


,53 


c 




ATOM 


9169 


07 


NAG 


c 


1 


60. 


,267 


47. 


803 


14. 


068 


1 


.00 


53. 


,37 


c 




ATOM 


9170 


C8 


NAG 


c 


1 


61. 


.782 


46. 


528 


15. 


397 


1 


.00 


51. 


,81 


c 


35 


ATOM 


9171 


C3 


NAG 


c 


1 


60. 


,046 


43. 


300 


13. 


495 


1 


.00 


49. 


,65 


c 




ATOM 


9172 


03 


NAG 


c 


1 


61, 


. 101 


42. 


997 


14 . 


395 


1 


.00 


50. 


,85 


c 




ATOM 


9173 


C4 


NAG 


c 


1 


58. 


. 905 


42. 


304 


13. 


689 


1 


.00 


49. 


,25 


c 




ATOM 


9174 


04 


NAG 


c 


1 


59. 


. 345 


40. 


999 


13. 


,344 


1 


.00 


50. 


,57 


c 




ATOM 


9175 


C5 


NAG 


c 


1 


57. 


.716 


42. 


702 


12. 


810 


1 


.00 


49. 


,13 


c 


40 


ATOM 


9176 


05 


NAG 


c 


1 


57. 


.295 


44. 


050 


13. 


123 


1 


.00 


47. 


,36 


c 




ATOM 


9177 


C6 


NAG 


c 


1 


56. 


.518 


41. 


,797 


13. 


,024 


1 


.00 


48. 


,96 


c 




ATOM 


9178 


06 


NAG 


c 


1 


55. 


.511 


42. 


,038 


12. 


,052 


1 


.00 


50. 


,02 


c 




ATOM 


9179 


CI 


SWA 




1 


31. 


.064 


66. 


,873 


6. 


,079 


1 


.00 


9. 


.68 






ATOM 


9180 


01 


SWA 




1 


31. 


.597 


68. 


,032 


5. 


,416 


1 


.00 


11. 


.09 




45 


ATOM 


9181 


C3 


SWA 




1 


31, 


.295 


67. 


,013 


7. 


, 615 


1 


.00 


10. 


.48 






ATOM 


9182 


N4 


SWA 




1 


30, 


.738 


65. 


,836 


8. 


.320 


1 


.00 


10. 


.85 






ATOM 


9183 


C5 


SWA 




1 


29, 


.276 


65. 


.603 


8. 


. 133 


1 


.00 


10. 


. 14 






ATOM 


9184 


C6 


SWA 




1 


28, 


.973 


65. 


,471 


6. 


, 610 


1 


.00 


9. 


. 14 






ATOM 


9185 


C2 


SWA 




1 


29, 


.531 


66. 


,700 


5. 


,827 


1 


.00 


9. 


.59 




50 


ATOM 


9186 


C9 


SWA 




1 


31, 


.268 


65. 


,873 


9. 


, 682 


1 


.00 


11. 


.70 






ATOM 


9187 


C8 


SWA 




1 


32, 


. 681 


66. 


,486 


9. 


.558 


1 


.00 


11, 


.30 






ATOM 


9188 


013 


SWA 




1 


33 


. 681 


65. 


.511 


9. 


, 836 


1 


.00 


11. 


.34 






ATOM 


9189 


C7 


SWA 




1 


32 


.764 


67. 


.083 


8. 


.112 


1 


.00 


10. 


.44 






ATOM 


9190 


Oil 


SWA 




1 


33, 


.674 


66. 


.300 


7. 


.314 


1 


.00 


10, 


.67 
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ATOM 


9191 


CI 


MPD 


M 


1 


14 , 


.797 


61, 


.266 


10, 


. 322 


1 


.00 


23, 


.09 


M 



415 



ATOM 


.7 X J Z. 






M 


J. 


1 £ 


0 C^A 


ox . 




x U . 


a i a 
. b 1 4 


i 

i . 


uu 


O "5 
Z 3 . 


. bU 


ATOM 


9193 


02 


MPD 


M 


1 


16, 


.876 


60. 


330 


9. 


.851 


1. 


00 


24 , 


.23 


ATOM 


9194 


CM 


MPD 


M 


1 


17. 


,075 


62. 


611 


10. 


,000 


1. 


00 


24 . 


.36 


ATOM 


9195 


C3 


MPD 


M 


1 


16. 


.492 


61. 


271 


12. 


,125 


1. 


00 


23. 


.19 


ATOM 


9196 


C4 


MPD 


M 


1 


17. 


.813 


60, 


945 


12. 


. 649 


1. 


00 


23. 


,12 


ATOM 


9197 


04 


MPD 


M 


1 


17. 


.580 


59. 


936 


13. 


627 


1. 


00 


22. 


,00 


ATOM 


9198 


C5 


MPD 


M 


1 


18. 


.337 


62. 


222 


13. 


,387 


1. 


00 


22. 


, 61 


ATOM 


9199 


2N 


ZN 


Z 


1 


34. 


,561 


64. 


335 


8. 


062 


1. 


00 


15. 


.34 



END 

10 
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Table 3 

Structural Results: Active Site 



5 From initial analysis of the crystal structure, based on the location of residues known to be 
involved in catalysis, the active site of the enzyme is located within a region composed of the 
following residues: 





GLY 


60 


VAL 


61 


TRP 


62 


LYS 


63 


GLN 


64 


GLY 


65 




ILE 


68 


VAL 


83 


PHE 


84 


VAL 


85 


VAL 


86 


PRO 


87 




HIS 


88 


SER 


89 


HIS 


90 


ASN 


91 


ASP 


92 


TRP 


95 




ILE 


110 


met 


124 


PHE 


126 


ILE 


127 


TRP 


128 


ala 


129 




GLU 


130 


VAL 


161 


MET 


167 


TRP 


201 


ALA 


202 


ILE 


203 


Q: 


ASP 


204 


PRO 


205 


PHE 


206 


GLY 


207 


HIS 


208 


LEU 


225 


ILE 


226 


GLN 


227 


ARG 


228 


thr 


229 


TYR 


231 


LYS 


234 




LEU 


243 


HIS 


262 


MET 


263 


MET 


264 


PRO 


265 


PHE 


266 




TYR 


267 


SER 


268 


TYR 


269 


ASP 


270 


ILE 


271 


PRO 


272 




HIS 


273 


THR 


274 


CYS 


275 


GLY 


276 


PRO 


277 


ASP 


278 


ly 


PRO 


279 


LYS 


280 


val 


281 


CYS 


282 


CYS 


283 


GLN 


284 




PHE 


285 


ASP 


286 


PHE 


287 


LYS 


288 


ARG 


289 


met 


290 




phe 


293 


giy 


294 


leu 


295 


ser 


296 


CYS 


297 


PRO 


298 




TRP 


299 


lys 


300 


VAL 


301 


PRO 


302 


PRO 


303 


leu 


317 


P 


LEU 


318 


GLN 


321 


trp 


322 


LYS 


324 


LYS 


325 


ala 


326 


a ; J; 


LEU 


328 


tyr 


329 


LEU 


334 


LEU 


335 


ile 


336 


PRO 


337 




LEU 


338 


GLY 


339 


ASP 


340 


ASP 


341 


PHE 


342 


ARG 


343 




phe 


344 


lys 


345 


GLU 


349 


val 


352 


GLN 


353 


arg 


354 




TYR 


357 


LEU 


360 


PHE 


361 


PHE 


376 


GLY 


377 


LEU 


379 




TYR 


407 


ALA 


408 


ASP 


409 


ARG 


410 


trp 


415 


HIS 


471 




ASP 


472 


TYR 


727 


ASP 


874 


glu 


875 


ARG 


876 


GLY 


877 



LEU 878 



Residues in lower case are not identical in the human Golgi Manll sequence, though many of 
those are conservative substitutions. In this sphere of 15A around the catalytic center, there 
are only 21 non-identities among 121 residues, indicating that the human active site will be 
essentially identical to that observed for the Drosophila structure. 

The numbering system in this and the accompanying Figures corresponds to the protein 
expressed in the system described herein, not to full length Manll. 
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Table 4 

Intermolecular Interactions at an Active Site of a Mannosidase II Swainsonine Complex. 



No. of Atomic 


Swainsonine 


Enzyme 


Distance 


Atomic 


Interaction 


Atomic 


Atomic 


Between 


Interaction 




Contact 


Contact 


Atomic 


Property 








Contacts 




1 


01 


His 471 NE2 


3.3 


HB 


2 


N 


Asp 204 0D1 


2.9 


ITD 


3 


02 


Asp 34 1 0D2 


3.4 


LTD 


4 


02 


His 90 NE2 


3.1 


14 PI 


5 




A<;n 9? DDI 


2.9 


HB 


6 


08 


Asp 472 OD1 


2.53 


HB 


7 


Ring 


Phe 206 ring 


3.55 


VW 


8 


08 


Tyr 727 OH 


2.6 


HB 


9 


ring 


Trp95 


3.7 


VW 



5 



HB: hydrogen bond interaction 
VW Van der Waals 
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Table 5 

Crystallographic Refinement Statistics for the Native Drosophila Gogli 
Mannosidase II 



>>> input coordinates: dgml2_ann_lbi . pdb 
>>> molecular structure file: dgml2gen .mtf 



>>> parameter file 
»> parameter file 
>>> parameter file 
>>> parameter file 
>>> parameter file 
>>> parameter file 
>>> parameter file 
»> reflection file= 



CNS_TOPPAR : protein_rep . param 
CNS_TOPPAR : wat er_rep . param 
CNS JTOPPAR : ion . param 
trs .par 
mpdnew2 . par 
cis_peptide . param 
CNS_TOPPAR : carbohydrate . param 
/semetHiR. cv 
»> spacegroup: P2 (1) 2 (1) 2 (1) 

»> cell dimensions: a= 68.865 b= 109.718 c= 138.599 alpha= 90 beta= 90 
gamma = 90 

»> current wa= 0.311868 for target= mlf 
>>> ncs= none 

»> initial B-factor correction applied to fobs : 
»> Bll= 0.587 B22= -0.754 B33= 0.167 
»> B12= 0.000 B13= 0.000 B23= 0.000 

»> B-factor correction applied to coordinate array B: 0.088 
»> B-correction resolution: 6.0 - 1.4 

»> bulk solvent: density level= 0.359928 e/A A 3, B-factor= 43.085 A"2 



summary 



resolution range: 500.0 - 1.4 A 
R-values : 
initial 

after B-factor and/or bulk solvent correction 



r= 0.2348 f ree_r= 0.2361 
r= 0.1931 free r= 0.2105 



Monitor for target "mlf" is R-value : 
working set= 0.1931 test set= 0.2105 

luzzati coordinate error (5.0 - 1.4 A ) 

cross-validated luzzati coordinate error (5.0 - 1.4 A ) 

sigmaa coordinate error (5.0 - 1.4 A ) 

cross-validated sigmaa coordinate error (5.0 - 1.4 A ) 

rmsd bonds= 0.004763 with 2 bond violations > 0.05 
rmsd angles^ 1.32400 with 32 angle violations > 8.0 

rmsd dihedrals= 24.22585 with 4 angle violations > 60.0 

rmsd improper= 0.80790 with 38 angle violations > 3.0 



0.16 A 

0.18 A 

0.08 A 

0.10 A 



B-f actors 



average B-factor= 15.7963 
minimum B-factor= 4.5681 
maximum B-factor= 57.2481 

B rmsd for bonded mainchain atoms= 0.660 
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B rmsd for bonded sidechain atoms= 1.229 

B rmsd for angle mainchain atoms= 1.131 

B rmsd for angle sidechain atoms= 1.777 
current rweight= 0. 22677665 



reflections with | Fobs | /sigma_F < 0.0 rejected 
10 reflections with | Fobs | > 10000 * rms(Fobs) rejected 

theoretical total number of refl. in resol. range: 206243 

number of unobserved reflections (no entry or |F|=0): 59797 

number of reflections rejected: 0 

total number of reflections used: 146446 

15 number of reflections in working set: 139067 

number of reflections in test set: 7379 



100.0 % 

29.0 % 

0.0 % 

71.0 % 

67.4 % 

3.6 % 



20 



=> completeness 



Test set (test = 1) 



25 



30 



35 



40 



45 



#bin i 


resolution range 


1 #refl 1 






1 


3.02 


500.01 


1113 


0. 


0521 


2 


2.39 


3.02 


1042 


0. 


0501 


3 


2.09 


2.39 


1048 


0. 


0508 


4 


1.90 


2.09 


1000 


0. 


0485 


5 


1.76 


1. 90 


989 


0. 


0482 


6 


1.66 


1.76 


822 


0. 


0401 


7 


1.58 


1. 66 


584 


0. 


0285 


8 


1.51 


1.58 


419 


0. 


0205 


9 


1.45 


1.51 


265 


0. 


0130 


10 


1.40 


1.45 


97 


0 


0048 


Working set : 










#bin | 


resolution range 


1 #refl 1 






1 


3.02 


500.01 


20201 


0 


9460 


2 


2.39 


3.02 


19732 


0 


9491 


3 


2.09 


2.39 


19506 


0 


9447 


4 


1.90 


2.09 


19199 


0 


9316 


5 


1.76 


1.90 


18521 


0 


9019 


6 


1.66 


1.76 


15253 


0 


7434 


7 


1.58 


1.66 


11380 


0 


5551 


8 


1.51 


1.58 


8211 


0 


4018 


9 


1.45 


1.51 


5065 


0 


2477 


10 


1.40 


1.45 


1999 


0 


0980 



50 



R-values 



55 



R- values with | Fobs I /sigma cutoff = 0.0 
Test set (test = 1) : 
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#bin I resolution range I irefl I 



1 


3.02 


500.01 


1113 


0. 


,1999 


2 


2.39 


3.02 


1042 


0. 


,2110 


3 


2.09 


2.39 


1048 


0. 


,2083 


4 


1. 90 


2.09 


1000 


0. 


,2137 


5 


1.76 


1.90 


989 


0. 


.2244 


6 


1.66 


1.76 


822 


0. 


,2302 


7 


1.58 


1.66 


584 


0. 


,2275 


8 


1.51 


1.58 


419 


0, 


,2363 


9 


1.45 


1.51 


265 


0, 


.2186 


10 


1.40 


1.45 


97 


0, 


.2706 



Working set: 
15 #bin | resolution range | #refl I 



20 



25 



1 


3.02 


500. 


01 


20201 


0. 


1858 


2 


2.39 


3. 


02 


19732 


0. 


,1989 


3 


2.09 


2. 


39 


19506 


0. 


1935 


4 


1.90 


2. 


09 


19199 


0. 


1922 


5 


1.76 


1. 


90 


18521 


0. 


.1934 


6 


1.66 


1. 


76 


15253 


0. 


, 1973 


7 


1.58 


1. 


66 


11380 


0. 


,1989 


8 


1.51 


1. 


58 


8211 


0. 


,2067 


9 


1.45 


1. 


51 


5065 


0. 


,2248 


10 


1.40 


1. 


45 


1999 


0. 


,2589 



sigmaa-values 



30 sigmaa calculated using cross-validated data (test set) 





number 


of bins 


for sigmaa 


calculation^ 


50 




#bin | 


I resolution range 


1 #refl 1 








1 


5.16 


500.01 


4426 


0. 


, 9319 


35 


2 


4.09 


5.16 


4270 


0. 


, 9576 




3 


3.58 


4 .09 


4227 


0. 


, 9615 




4 


3.25 


3.58 


4196 


0. 


, 9632 




5 


3.02 


3.25 


4195 


0. 


,9422 




6 


2.84 


3.02 


4177 


0. 


,9512 


40 


7 


2.70 


2.84 


4163 


0. 


,9369 




8 


2.58 


2.70 


4152 


0. 


,9424 




9 


2.48 


2.58 


4156 


0. 


, 9486 




10 


2.39 


2.48 


4126 


0. 


, 9448 




11 


2.32 


2.39 


4131 


0. 


, 9407 


45 


12 


2.25 


2.32 


4149 


0. 


, 9498 




13 


2.19 


2.25 


4101 


0. 


. 9417 




14 


2.14 


2.19 


4113 


0. 


, 9351 




15 


2.09 


2.14 


4060 


0. 


.9459 




16 


2.05 


2.09 


4096 


0. 


.9333 


50 


17 


2.01 


2.05 


4078 


0. 


.9552 




18 


1. 97 


2.01 


4005 


0. 


. 9445 




19 


1.93 


1.97 


4001 


0. 


. 9443 




20 


1.90 


1.93 


4019 


0. 


.9352 




21 


1.87 


1.90 


3971 


0. 


. 9411 


55 


22 


1.84 


1.87 


3941 


0. 


. 9319 
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23 


1.81 


1 . 


. 84 


3946 


0. 


9360 


24 


1.79 


1 , 


. 81 


3860 


0. 


9480 


25 


1.76 


1 . 


.79 


3792 


0. 


9213 


26 


1 .74 


1 . 


.76 


3632 


0. 


9184 


27 


1 .72 


1 , 


.74 


3458 


0. 


9382 


28 


1 . 70 


1 . 


. 72 


3191 


0. 


9414 


29 


1 . 68 


1 , 


.70 


2994 


0. 


9275 


30 


1 . 66 


1 , 


, 68 


2800 


0. 


9268 


31 


1 . 64 


1 , 


. 66 


2678 


0. 


9303 


32 


1 . 62 


1 , 


. 64 


2506 


0. 


9388 


33 


1 . 61 


1 . 


. 62 


2366 


0. 


9506 


34 


1.59 


1 , 


. 61 


2249 


0. 


9402 


35 


1 . 58 


1 , 


. 59 


2165 


0. 


9270 


36 


1.56 


1 . 


. 58 


2006 


0. 


9365 


37 


1 . 55 


1 ( 


. 56 


1874 


0. 


9184 


38 


1 . 53 


1 , 


. 55 


1700 


0 . 


9072 


39 




1 ( 


. 53 


1595 


0 . 


9358 


40 




1 1 


. 52 


1455 


0 . 


9398 


4 1 


1 . 50 


1 . 


. 51 


1336 


0. 


9433 


42 


1.48 


1 ( 


. 50 


1147 


0 . 


9541 




i 47 


1 1 


. 4 8 


1077 


0 . 


9419 


44 


1.46 


1. 


.47 


962 


0. 


9331 


45 


1.45 


1. 


.46 


808 


0. 


9242 


46 


1.44 


1. 


.45 


696 


0. 


9038 


47 


1.43 


1. 


.44 


570 


0. 


9371 


48 


1.42 


1, 


.43 


408 


0. 


9387 


49 


1.41 


1. 


. 42 


288 


0. 


8914 


50 


1.40 


1, 


.41 


134 


0. 


8194 



============================ non-trans peptides 



cis-peptide: segid=A resid=406 resname=THR 

current dihedral value= -0.010 

cis-peptide: segid=A resid=532 resname=PRO 

current dihedral valuer -0.435 

=============================== occupancies 



no atoms have zero occupancy 
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Table 6 

Crystal lographic Refinement Statistics 
II Associated with Swainsonine 



for the Drosophila Golgi Mannosidase 



»> input coordinates: swainsonine3_ann_lbi .pdb 
>>> molecular structure file: swainsoninegen3 .mtf 



>» parameter file 
>>> parameter file 
»> parameter file 
>>> parameter file 
»> parameter file 
>» parameter file 
»> parameter file 



1 : CNS_TOPPAR:protein_rep. param 

2 : CNSJTOPPAR:water_rep. param 

3 : CNSJTOPPAR: ion.param 

4 : swa.par 

5 : . . /zntrmp/mpdnew2 .par 

6 : cis_peptide .param 

7 : CNS_TOPPAR: carbohydrate .param 
»> reflection file= dgm2native_re jmerge . cv 

>» spacegroup: P2 (1)2(1)2(1) 

»> cell dimensions: a= 68.902 b= 110.015 c= 138.472 alphas 90 beta= 90 
gamma= 90 

»> current wa= 0.632464 for target= mlf 
>>> ncs= none 

»> initial B-factor correction applied to fobs : 
»> Bll= 0.551 B22= -0.116 B33= -0.435 
»> B12= 0.000 B13= 0.000 B23- 0.000 

>>> B-factor correction applied to coordinate array B: 0.413 
»> B-correction resolution: 6.0 - 1.87 

»> bulk solvent: density level- 0.354131 e/A A 3, B-factor= 42.2797 A"2 



summary 



resolution range: 500.0 - 1.87 A 
R-values : 
initial 

after B-factor and/or bulk solvent correction 



r= 0.2078 free_r= 0.2371 
r= 0.1810 free r= 0.2090 



Monitor for target "mlf" is R-value : 
working set= 0.1810 test set= 0.2090 



luzzati coordinate error (5.0 - 1.87 A ) 

cross-validated luzzati coordinate error (5.0 * 1.87 A ) 

sigmaa coordinate error (5.0 - 1,87 A ) 

cross-validated sigmaa coordinate error (5.0 - 1.87 A ) 



0.19 A 

0.22 A 

0.12 A 

0.14 A 



rmsd bonds= 0.005230 with 2 bond violations > 0.05 
rmsd angles= 1.31525 with 34 angle violations > 8.0 
rmsd dihedrals^ 24.18605 with 4 angle violations > 60.0 
rmsd improper= 0.78741 with 34 angle violations > 3.0 



average B-factor= 19.441 
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minimum B-factor= 8.18268 

maximum B-factor= 64.5527 

B rmsd for bonded mainchain atoms= 0.699 

B rmsd for bonded sidechain atoms= 1.141 

B rmsd for angle mainchain atoms= 1.167 

B rmsd for angle sidechain atoms= 1.747 

current rweight= 0.18691688 



10 



diffraction data 



15 



20 



reflections with I Fobs I /sigma_F < 0.0 rejected 
reflections with | Fobs | > 10000 * rms(Fobs) rejected 
theoretical total number of refl. in resol. range: 
number of unobserved reflections (no entry or |F|=0): 
number of reflections rejected: 
total number of reflections used: 
number of reflections in working set: 
number of reflections in test set: 



87643 ( 100.0 % 
2814 ( 3.2 % 



=> completeness 



0 ( 

84829 { 

80543 ( 

4286 ( 



0.0 % 

96.8 % 

91.9 % 
4.9 % 





Test 


set 


(test 


= 1) : 








25 


#bin 


I resolution range 


#refl 1 








1 


4 


03 500.01 


452 


0. 


0493 




2 


3. 


20 


4.03 


488 


0. 


0550 




3 


2 


79 


3.20 


398 


0. 


0453 




4 


2 


54 


2.79 


433 


0. 


0496 


30 


5 


2. 


36 


2.54 


460 


0. 


0527 




6 


2 


22 


2.36 


433 


0. 


0497 




7 


2. 


11 


2.22 


417 


0. 


0480 




8 


2. 


01 


2.11 


402 


0. 


0464 




9 


1. 


94 


2.01 


406 


0. 


0470 


35 


10 


1. 


87 


1.94 


397 


0. 


0457 



40 



45 



50 



Working set: 

#bin I resolution range 



1 


4. 


03 


500.01 


2 


3. 


20 


4 .03 


3 


2. 


79 


3.20 


4 


2. 


54 


2.79 


5 


2. 


36 


2.54 


6 


2. 


22 


2.36 


7 


2. 


11 


2.22 


8 


2. 


01 


2.11 


9 


1. 


94 


2.01 


10 


1. 


87 


1. 94 



#refl 

8521 

8330 

8244 

8134 

8056 

8022 

7985 

7940 

7835 

7476 



0.9298 
0.9385 
0. 9391 
0.9315 
0.9234 
0.9215 
0.9196 
0.9158 
0.9077 
0.8615 



55 R-values with I Fobs I /sigma cutoff = 0.0 
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Test set (test 



1) : 



10 



15 



20 



25 



#bin | 


resolution range 


#refl 1 






1 


4.03 


500.01 


452 


0. 


2028 


2 


3.20 


4.03 


488 


0. 


1836 


3 


2.79 


3.20 


398 


0. 


2059 


4 


2.54 


2.79 


433 


0. 


2246 


5 


2.36 


2.54 


*t \j \j 


o 


2203 


6 


2.22 


2.36 




0 . 


2125 


7 


2.11 


2.22 




n 


2304 


8 


2.01 


2.11 


4 (19 

*i U Z, 


n 




9 


1.94 


2.01 


4 Ofi 


n 


2154 


10 


1.87 


1. 94 




n 




Working set: 










#bin I 


resolution range 


wreti i 






1 


4.03 


500.01 


8521 


0. 


1673 


2 


3.20 


4.03 


8330 


0. 


1704 


3 


2.79 


3.20 


8244 


0. 


1861 


4 


2.54 


2.79 


8134 


0. 


1857 


5 


2.36 


2.54 


8056 


0. 


1830 


6 


2.22 


2.36 


8022 


0. 


1827 


7 


2.11 


2.22 


7985 


0. 


1894 


8 


2.01 


2.11 


7940 


0. 


1826 


9 


1.94 


2.01 


7835 


0. 


1920 


10 


1.87 


1.94 


7476 


0. 


2320 



30 



sigmaa- values 



35 



40 



45 



50 



55 



sigmaa calculated using cross-validated data (test set) 
number of bins for sigmaa calculation= 50 



#bin 


resolution range 


#refl 1 






1 


6.89 


500.01 


1778 


0. 


9682 


2 


5.47 


6.89 


1834 


0. 


9324 


3 


4 .78 


5.47 


1793 


0. 


9330 


4 


4.34 


4.78 


1797 


0. 


9551 


5 


4 .03 


4.34 


1771 


0. 


9634 


6 


3.79 


4 .03 


1778 


0. 


9700 


7 


3.60 


3.79 


1751 


0. 


9563 


8 


3.44 


3.60 


1749 


0. 


9651 


9 


3.31 


3.44 


1775 


0. 


9515 


10 


3.20 


3.31 


1765 


0. 


9589 


11 


3.10 


3.20 


1714 


0. 


9433 


12 


3.01 


3.10 


1746 


0. 


9571 


13 


2.93 


3.01 


1737 


0. 


9545 


14 


2.86 


2.93 


1721 


0. 


9457 


15 


2.79 


2.86 


1724 


0. 


9391 


16 


2.73 


2.79 


1727 


0. 


9327 


17 


2.68 


2.73 


1708 


0. 


9429 


18 


2.63 


2. 68 


1726 


0. 


9267 


19 


2.58 


2.63 


1737 


0. 


9416 
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20 


2. 54 


2 . 58 


1669 


0. 


9385 


21 


2. 50 


2.54 


1705 


0. 


9257 


22 


2.46 


2.50 


1713 


0. 


9444 


23 


2.42 


2.46 


1721 


0. 


9384 


24 


2.39 


2.42 


1665 


0. 


9456 


25 


2. 36 


2.39 


1712 


0. 


9398 


26 


2. 33 


2.36 


1699 


0. 


9427 


27 


2. 30 


2.33 


1675 


0. 


9341 


28 


2.27 


2.30 


1723 


0. 


9324 


29 


2 .24 


2.27 


1653 


0. 


9570 


30 


2. 22 


2 .24 


1705 


0. 


9355 


31 


2. 19 


2 . 22 


1711 


0. 


9093 


32 


2 . 17 


2 . 19 


1650 


0. 


9188 


33 


2 . 15 


2. 17 


1659 


0. 


9341 


34 


2. 13 


2 . 15 


1707 


0. 


9520 


35 


2. 11 


2 . 13 


1675 


0. 


9298 


36 


2.09 


2 . 11 


1655 


0. 


9446 


37 


2 . 07 


2 . 09 


1691 


0. 


9407 


38 


2 . 05 


2 . 07 


1685 


0. 


9234 


39 


2 . 03 


2 . 05 


1631 


0. 


9412 


40 


2 . 01 


2 . 03 


1680 


0. 


9613 


41 


2 . 00 


2 . 01 


1646 


0. 


9423 


42 


1 . 98 


2 . 00 


1669 


0. 


9395 


43 


1 . 97 


1 . 98 


1626 


0. 


9240 


44 


1.95 


1.97 


1693 


0. 


9373 


45 


1. 94 


1.95 


1607 


0. 


9534 


46 


1.92 


1. 94 


1634 


0. 


9255 


47 


1.91 


1.92 


1652 


0. 


9181 


48 


1.90 


1.91 


1616 


0. 


9194 


49 


1.88 


1. 90 


1597 


0. 


9026 


50 


1.87 


1.88 


1374 


0. 


8987 



non-trans peptides 



cis-peptide : segid=A resid=406 resname=THR 

current dihedral value= 0.298 

cis-peptide: segid=A resid=532 resname=PRO 

current dihedral value= -0.154 

=============================== occupancies 



no atoms have zero occupancy 



Table 7 



- Glycoside Hydrolase Classification 





CAZy Family Glycoside Hydrolase Family 38 

^ Known Activities cc-mannosidase (EC 3.2. 1 .24) (EC 3.2.1.11 4). 

Mechanism Retaining 

Catalytic 
Nucleophile/Base K 

Catalytic Proton Donor Not known 

3D Structure Status Not available 

Relevant Links InterPro; PFAM 

Statistics CAZy(48); GenBank/GenPept (108); Swissprot (31) 





Protein 


Organism 


EC# 


GenBank / GenPept SwissProt 


PDB/ 
3D 




1-m 



St^anrto^^ 

a^anno^dase \;.Cwj&al&hnsfc : ;3.2.^24i : ^ 

ct^arinb^dase (a-Ma^ 




ORF BcDNA.GH02419 



Drosophila melanogaster 



n.d. 



AF145601 AAD38576.1 
^003628 A/^52958, 1 

;ORF£§^ 

ORFCG9463 Drosophila melanogaster n.d. AE003622 AAF52708.1 

ORFCG9465 Drosophila" melanogaster n.d: AE003622 AAF52709.1 

ORF CG9466 Drosophila melanogaster n.d. AE003622 AAF52710.1 

ORFCG9468 :Dro^ophilamalahog^er n.d. AE003622 AAF52711.1 



ORF YbgB 



Escherichia cpli K12 



n.d. 



D90713 BAA35398.1 



O& Ytefc ^rWmm?*!^ P54746 
• : - ; . ; P75753 

a-mannosidase (lysosomal) Fe//s cafus 



3.2.1.24 AF010191 AAB97672.1 046432 
AF010192AAB97733.1 

Q16766 

U37248 AAO00568.1 : Q13358 




£5^3 £^ 0 81 i/f ; G93093 ' 




a^ai)ri$S^ 

a-mahhc^dase 6A8B : t0Hpfjio sapiens 



.__ ... 

3r?KIm693^ : ' n ^. O AB023152 BAA7677S,1 




a-mannosidase 2 ^Musm 

ORFRv0648 Mycobacterium tuberculosis 

H37Rv 

ORFPH0835 ; IPffi 

a-mannosidase 1 Ratfus norvegicus 



3.2.1.114 X61172 CAA43480.1 P27046 

:i? : :oj;ri:^|P 

r ^ 292772 CAB07105. 1 P96937 

n.d. AP000003 BAA29929. 1 058565 

3.2.1.24 M57547AAA4 1565.1 P21139 



a-mannosidas Saccharomyces cerevisiae . 3:2.1.24 M27809 AM34423.1 P22855 

M29146AAA34423.1 
Z48618CAA88536.1 
272678 CAA96868.1 

ocrmannosidase II Spoddptera frugiperda ' -ndr- \/ : "ftfe0O56a4/Ai?te62719.1 , 018497 

a-manho^dase /-.. _ Sussdrofa ■ -V^^:/"?:^ Q28949 

ORFTM1231 "; • ; : ;7^^<^6srimarTfe?a ;AE06l779)^^ ;Q9X0V8 

cc^mannpsidase t ; ' .- ^Mm^-^^l'^'''- •'■■•'^■'■■^ 043249 



Table 8 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATOM 


1 


N 


CYS 


A 


31 


44 . 


192 


36. 


201 


-18. 


860 


1. 


00 


0. 


00 


N 


ATOM 


2 


CA 


CYS 


A 


31 


43. 


432 


37. 


453 


-18. 


569 


1. 


00 


0. 


00 


C 


ATOM 


3 


C 


CYS 


A 


31 


41. 


925 


37. 


255 


-18. 


672 


1. 


00 


0. 


00 


c 


ATOM 


4 


O 


CYS 


A 


31 


41. 


437 


36. 


646 


-19. 


622 


1. 


00 


0. 


00 


0 


ATOM 


5 


CB 


CYS 


A 


31 


43. 


821 


38. 


563 


-19. 


546 


1. 


00 


0. 


00 


c 


ATOM 


6 


SG 


CYS 


A 


31 


45. 


504 


39. 


239 


-19. 


,408 


1. 


00 


0. 


,00 


S 


ATOM 


7 


N 


GLN 


A 


32 


41. 


194 


37. 


780 


-17. 


693 


1. 


00 


0. 


00 


N 


ATOM 


8 


CA 


GLN 


A 


32 


39. 


739 


37. 


692 


-17. 


702 


1. 


00 


0. 


,00 


c 


ATOM 


9 


C 


GLN 


A 


32 


39. 


199 


38. 


,594 


-18. 


,800 


1. 


00 


0. 


,00 


c 


ATOM 


10 


O 


GLN 


A 


32 


39. 


785 


39. 


630 


-19. 


113 


1. 


00 


0. 


00 


0 


ATOM 


11 


CB 


GLN 


A 


32 


39. 


146 


38. 


164 


-16. 


373 


1. 


00 


0. 


00 


c 


ATOM 


12 


CG 


GLN 


A 


32 


39. 


160 


37. 


145 


-15. 


,258 


1. 


00 


0. 


,00 


c 


ATOM 


13 


CD 


GLN 


A 


32 


38. 


246 


37. 


,549 


-14. 


,118 


1. 


00 


0. 


,00 


c 


ATOM 


14 


OE1 


GLN 


A 


32 


37. 


027 


37. 


630 


-14. 


285 


1. 


00 


0. 


,00 


0 


ATOM 


15 


NE2 


GLN 


A 


32 


38. 


829 


37. 


814 


-12. 


.955 


1. 


00 


0. 


,00 


N 


ATOM 


16 


N 


ASP 


A 


33 


38. 


079 


38. 


,196 


-19. 


,384 


1. 


00 


0. 


,00 


N 


ATOM 


17 


CA 


ASP 


A 


33 


37. 


449 


38. 


989 


-20. 


427 


1. 


00 


0. 


,00 


c 


ATOM 


18 


C 


ASP 


A 


33 


36. 


549 


39. 


962 


-19. 


,668 


1. 


00 


0. 


,00 


c 


ATOM 


19 


O 


ASP 


A 


33 


35. 


610 


39. 


545 


-18. 


998 


1. 


00 


0. 


,00 


0 


ATOM 


20 


CB 


ASP 


A 


33 


36. 


,638 


38. 


.070 


-21. 


,344 


1. 


00 


0. 


,00 


c 


ATOM 


21 


CG 


ASP 


A 


33 


36. 


,037 


38. 


,801 


-22. 


,524 


1. 


00 


0. 


.00 


c 


ATOM 


22 


OD1 


ASP 


A 


33 


35. 


.788 


38, 


,143 


-23. 


,557 


1. 


00 


0. 


.00 


0 


ATOM 


23 


OD2 


ASP 


A 


33 


35. 


802 


40. 


024 


-22. 


,416 


1. 


00 


0. 


.00 


0 


ATOM 


24 


N 


VAL 


A 


34 


36. 


,844 


41. 


,255 


-19. 


,757 


1. 


00 


0. 


,00 


N 


ATOM 


25 


CA 


VAL 


A 


34 


36, 


,065 


42. 


.254 


-19. 


,031 


1. 


00 


0. 


,00 


c 


ATOM 


26 


C 


VAL 


A 


34 


34. 


.893 


42. 


,824 


-19, 


,820 


1. 


00 


0. 


.00 


c 


ATOM 


27 


O 


VAL 


A 


34 


34. 


,211 


43, 


,735 


-19. 


.354 


1. 


00 


0. 


.00 


0 


ATOM 


28 


CB 


VAL 


A 


34 


36. 


,969 


43. 


.425 


-18. 


.568 


1. 


00 


0, 


.00 


c 


ATOM 


29 


CGI 


VAL 


A 


34 


38. 


, 134 


42. 


.887 


-17. 


.755 


1. 


,00 


0, 


.00 


c 


ATOM 


30 


CG2 


VAL 


A 


34 


37. 


.491 


44. 


,203 


-19. 


, 774 


1. 


00 


0. 


.00 


c 


ATOM 


31 


N 


VAL 


A 


35 


34. 


.639 


42, 


,272 


-21. 


.001 


1. 


,00 


0. 


.00 


N 


ATOM 


32 


CA 


VAL 


A 


35 


33. 


,557 


42. 


.772 


-21. 


.836 


1. 


00 


0. 


.00 


c 


ATOM 


33 


C 


VAL 


A 


35 


32. 


,381 


41. 


.825 


-22. 


.061 


1. 


,00 


0. 


.00 


c 


ATOM 


34 


O 


VAL 


A 


35 


31. 


,224 


42. 


.211 


-21. 


.898 


1. 


.00 


0, 


.00 


0 


ATOM 


35 


CB 


VAL 


A 


35 


34. 


,092 


43. 


.170 


-23, 


.236 


1, 


.00 


0, 


.00 


c 


ATOM 


36 


CGI 


VAL 


A 


35 


32. 


. 946 


43. 


.675 


-24, 


.118 


1. 


,00 


0, 


.00 


c 


ATOM 


37 


CG2 


VAL 


A 


35 


35. 


,181 


44 . 


.222 


-23. 


.098 


1. 


,00 


0. 


.00 


c 


ATOM 


38 


N 


GLN 


A 


36 


32. 


.688 


40. 


.584 


-22. 


.417 


1. 


,00 


0. 


.00 


N 


ATOM 


39 


CA 


GLN 


A 


36 


31. 


.668 


39. 


.594 


-22, 


.768 


1. 


,00 


0, 


.00 


c 


ATOM 


40 


C 


GLN 


A 


36 


31, 


.067 


38, 


.670 


-21. 


.719 


1, 


,00 


0, 


.00 


c 


ATOM 


41 


O 


GLN 


A 


36 


30. 


.128 


37. 


.936 


-22, 


.022 


1. 


,00 


0, 


.00 


0 


ATOM 


42 


CB 


GLN 


A 


36 


32, 


.208 


38, 


.745 


-23, 


.915 


1, 


.00 


0, 


.00 


c 


ATOM 


43 


CG 


GLN 


A 


36 


32, 


.967 


39, 


.567 


-24, 


.944 


1. 


.00 


0, 


.00 


c 


ATOM 


44 


CD 


GLN 


A 


36 


33. 


.436 


38, 


.744 
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